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(57) Abstract: The invention relates to a recombinant polypeptide comprising amino acid sequence derived from at least one of 
O an HTV gag gene product; an HTV pol gene product; or an HIV nef gene product, said sequence being mutated with respect to 
^ the natural sequence of said gene product, and said sequence maintaining each of the naturally occurring CD8+ T cell epitopes of 
Q said gene product as defined in pl7 and p24 (gag), amino acids 1-440 of RT (pol) and nef shown in Example 8. Furthermore the 
invention relates to nucleic acids encoding same, and viral vectors encoding same, and to their use in medicine and in immunisation 
and vaccination. 
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HIV Pharmaccines 

FIELD OF THE INVENTION 

5 The invention relates to polypeptides comprising HIV antigens, in particular to 
polypeptides comprising mutated HIV sequences, said mutated sequences retaining 
their naturally occurring CD8+ epitopes. Furthermore, the invention relates to mutated 
HIV sequences into which extra T helper epitopes have been introduced. 

10 BACKGROUND TO THE INVENTION 

HIV is a pathogenic virus leading to debilitating and fatal immune deficiencies such as 
AIDS. Although there are certain therapies for conditions such as AIDS which caji 
prolong life expectancy and increase quality of life for affected individuals, the disease 
1 5 is usually terminal. 

Eliminating or controlling the virus in HIV infected patients is a problem. 

Prevention of infection with HIV is a problem which has thus far been mostly 
20 countered by social solutions such as modification of sexual behaviour and/or greater 
take up of sterile practices for users of hypodermic needles. 

Medical prevention of HTV infection remains a high priority area of research. There is 
a need for effective HIV vaccination, strategies. 

25 

Even when immune responses are mounted against certain element(s) of an HTV 
particle, there are problems of immunodominance, viral escape and presentation of 
viral proteins which can lead to amelioration of the response. Clearly, it is extremeLy 
hazardous to render HIV particles for use in vaccines. Furthermore, use of proteins 
30 identical to natural proteins may itself be problematic if these proteins have 
undesirable effects. Furthermore, the proteins may themselves have 
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immunosuppressive qualities and/or may lack sufficient CD8+ T cell epitopes for a 
suitably strong or broad immune response to be provoked. 

Viral escape by mutation of CD8+ T cell epitopes, especially immunodominant CD8+ 
5 T cell epitopes, is a significant problem in viral vaccination. Existing vaccines can 
exhibit poor recruitment of T helper cells and hence produce a narrow and/or weak 
immune response. Natural viral sequences, subject to evolutionary pressures in vivo, 
may possess relatively few immunologically significant epitopes. Furthermore, 
immunodominant effects can skew the clinical importance of certain CD8+ T cell 
10 epitopes and focus the response (if any) on these. Clearly, this serves to accentuate 
immunodominance problems, for example leading to sub selection of viral lines which 
mutate at this epitope, leading to potentially fatal viral escape (eg. see Barouch et al. 
2002 Nature 415 p.335). 

15 WO02/32943 (Nabel and Huang) disclose modifications of HIV ENV, GAG and POL 
to enhance immunogenicity for genetic immunization. In particular, they focus on 
modification of glycosylation of ENV, and on nucleic acid constructs encoding delta 
CFI HIV ENV. Furthermore, specific deletions of ENV cleavage site, fusogenic 
domain, and spacing of heptad repeats 1 and 2 are disclosed. Immunization with DNA 

20 plasmids encoding GAG alone or GAG-POL is described, the best results being 
alleged for the gag-pol plasmid immunisation. The bulk of WO02/32943 is concerned 
with the disclosure of specific plasmids listed in table 1 and the claims of 
WO02/32943. Also claimed are 'analogs' of various parts of these plasmids, and 
segments having at least 95% sequence identity thereto. To disrupt functions of Nef 

25 such as limitation of MHC class I and/or CD4 expression in WO02/32943, point 
mutations were introduced into the Nef gene from HIV-1 PV22. PV22 corresponds 
with the HXB2 reference sequence (see example 8) at the positions of mutations. Thus 
it can be seen that the mutations of WO02/32943 destroy naturally occurring epitopes. 
For example, amino acid substitutions in the Nef polypeptide of construct VRC4301 

30 are: P69A, P72A, P75A, P78A, D174A and D175A. Each of the 6 mutations lies 
within a defined epitope with reference to Example 8, and therefore the naturally 
occurring epitopes have not been retained in this modified Nef gene. In constructs 
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VRC 4306, VRC 4309, VRC 4310, VRC 4311, VRC 4312 and similar construacts in 
WO02/32943, the reverse transcriptase mutation made (D771H) falls within the 
YMDD motif (shown in epitope map as RT 183-186) which is an area with a number 
of recorded epitopes. Thus these epitopes have not been retained in these prior art 
5 constructs. Furthermore, synthetic gag/pol/nef genes disclosed in WO02/32943 have 
been made by truncating fusions such as described for VRC 4312 and similar 
constructs. With reference to VRC 4312, the synthetic gag gene was ligated in frame 
with sequences encoding synthetic nef gene that 51 aa were deleted from 5'; 77 aa 
were deleted from 5' of pol polyprotein, and ligated with 3' of nef in which tag stop 
10 codon was deleted. These truncations of pol and nef result in loss of epitopes and so 
the naturally occurring epitopes have not been retained in terms of full length 
gag/pol/nef genes. 

WO02/022080 (Merck and Co.) disclose modifications of HIVl-Gag, Pol and Nef. 

15 The principal modifications described are codon optimisation of the Gag, Pol and Nef 
genes. There is no mention of mutation of Gag polypeptide, only codon optimisation 
of the nucleic acid encoding it. Example 17 of WO02/022080 presents various 
mutations of the Pol protein; nine point mutations are presented, three of which are 
localised to each of the reverse transcriptase, RNAse and integrase regions of the 

20 protein. At least the D 112 A, D187A, D188A, D445A and D5 00 A mutations of HIV-1 
Pol are within human epitopes and therefore the naturally occurring epitopes have not 
been retained. At least the 'LLAA' mutations of FHV-1 and LRV-1 Nef (ie. L164A, 
L165A double mutant) are within human epitopes and therefore the naturally occurring 
epitopes have not been retained in this mutated Nef polypeptide. 

25 

WO03/025003 discloses various altered gag gene constructs, and also mention nef and 
pol gene constructs. The nef gene constructs appear to be truncated removing epitopes 
from the N-terminus of nef. The gag gene constructs appear not to be mutated. 



30 The present invention seeks, inter alia, to overcome some of the problem(s) discussed 
above. 
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SUMMARY OF THE INVENTION 

The present invention is based on the design of particularly effective presentation of 
HTV derived CD8+ T cell epitopes. In this way, the strongest and broadest immune 
5 response can be produced. 

This is accomplished by altering the context of the polypeptide on which the epitopes 
are carried. In this way, the natural biological function(s) of the viral polypeptides can 
be destroyed, advantageously rendering them safe for vaccination use, whilst carefully 
10 preserving naturally occurring epitopes which prior art techniques are known to 
destroy. 

In this and other aspects, supplementary (ie. non-naturally occurring) T helper epitopes 
are also introduced into the antigenic polypeptides, thereby advantageously 
1 5 strengthening and broadening the immune response. 

Thus, in one aspect the invention provides a recombinant polypeptide comprising 
amino acid sequence derived from at least one of 
(i) an HIV gag gene product; 
20 (ii) an HTV pol gene product; or 

(iii) an HIV nef gene product, 

said sequence being mutated with respect to the natural sequence of said gene product, 
and said sequence maintaining substantially all of the naturally occurring CD8+ T cell 
25 epitopes of the corresponding part(s) of said gene product as defined in pl7 and p24 
(gag), amino acids 1-440 of RT (pol) and nef shown in Example 8. 

In another aspect, the invention relates to a recombinant polypeptide as described 
above comprising amino acid sequence derived from at least two of (i), (ii) and (iii). 
30 When the polypeptide comprises amino acid sequence derived from only two of (i), 
(ii) and (iii), preferably they are (i) and (iii) ie. gag and nef. 
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In another aspect, the invention relates to a recombinant polypeptide as described 
above comprising amino acid sequence derived from (i) and (ii) and (iii). 

Preferably each sequence derived from (i), (ii) and (iii) which the recombinant 
5 polypeptide comprises is mutated with respect to the natural sequence of said gene 
product. Preferably each said sequence maintains substantially all of the naturally 
occurring CD8+ T cell epitopes, preferably all said epitopes. 

In another aspect, the invention relates to a recombinant polypeptide as described 
10 above wherein the amino acid sequences derived from (i) and/or (ii) and/or (iii) are 
arranged in the order (i) - (ii) - (iii) from the N terminus to the C terminus of the 
polypeptide. 

In another aspect the invention relates to a recombinant polypeptide comprising amino 
1 5 acid sequence derived from 

(i) an HIV gag gene product; 

(ii) an HIV pol gene product; and 

(iii) an HIV nef gene product, 

said sequence being mutated with respect to the natural sequence of said gene product, 
20 and said sequence maintaining substantially all of the naturally occurring CD8+ T cell 
epitopes of said gene product as defined in pl7 and p24 (gag), amino acids 1-440 of 
RT (pol) and nef shown in Example 8, said polypeptide comprising amino acid 
sequence having at least 75% identity to SEQ ID NO:9. Preferably said polypeptide 
comprises amino acid sequence having at least 95% identity to SEQ ID NO:9. 

25 

In another aspect, the sequence identity is interepitope sequence identity. This means 
that the epitopes correspond with 100% sequence identity to the epitopes in the natural 
or reference sequence, and the sequence identity quoted is for the remaining non- 
epitope regions of the sequence ie. the interepitope regions. Preferably this is 
30 calculated across all interepitope regions present. Preferably 'epitope' refers to the 
linear amino acid sequence to which the epitope has been mapped, preferably with 
reference to the epitopes mapped in Example 8. 
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In another aspect, the invention relates to a recombinant polypeptide as described 
above comprising SEQ ID NO:9, or a sequence having at least 95% identity thereto. 

5 In another aspect, the invention relates to a recombinant polypeptide as described 
above further comprising an antibody recognition tag. 

In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein said tag is an HA tag comprising the sequence as shown in SEQ ID 
10 NO:8. 

In another aspect, the invention relates to a recombinant polypeptide as described 
above further comprising a CD8+ T cell epitope tag. 

15 In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein said tag is a gpl60 derived tag comprising the sequence as shown in 
SEQIDNO:7. 

In another aspect, the invention relates to a recombinant polypeptide as described 
20 above, said polypeptide comprising the sequence as shown in SEQ ID NO: 1 . 

In another aspect, the invention relates to a recombinant polypeptide as described 
above, said polypeptide comprising amino acid sequence derived from an HIV nef 
gene product, said recombinant polypeptide sequence being mutated to disrupt the 
25 function of said nef sequence, said nef sequence further comprising one or more T 
helper epitopes which are not present in the naturally occurring nef gene. 



30 



In another aspect, the invention relates to a recombinant polypeptide as described 
above comprising one or more T helper epitopes which are not present in the naturally 
occurring nef sequence and are shown in Figure 3 A. 
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In another aspect, the invention relates to a recombinant polypeptide as described 
above further comprising substantially all of the naturally occurring nef CD8+ T cell 
epitopes as defined in Example 8. 

5 In another aspect, the invention Telates to a recombinant polypeptide as described 
above further comprising substantially all of the naturally occurring nef T helper 
epitopes as defined in Example 8. 

In another aspect, the invention relates to a recombinant polypeptide as described 
10 above wherein said polypeptide comprises sequence as shown in SEQ ID NO:6, or a 
sequence having at least 95% identity thereto. 

In another aspect, the invention relates to recombinant polypeptide as described above, 
said polypeptide comprising amino acid sequence derived from an HIV pol gene 
15 product, said recombinant polypeptide sequence being mutated to disrupt the reverse 
transcriptase activity of the pol sequence, wherein substantially all of the CD8+ T cell 
epitopes of the naturally occurring pol sequence as defined in amino acids 1-440 of RT 
(pol) shown in Example 8 are retained in said recombinant polypeptide. 

20 In another aspect, the invention relates to a recombinant polypeptide as described 
above, wherein the reverse transcriptase activity of said pol sequence is mutated by 
duplication of an internal sequence derived from the centre of the naturally occurring 
pol gene and exchange of the amino and carboxy terminal portions of said pol 
sequence. 

25 

In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein said duplicated internal sequence comprises TPDKKHQKEPPF (SEQ 
IDNO:4). 

30 In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein said polypeptide comprises sequence as shown in SEQ ID NO: 12 or a 
sequence having at least 95% identity thereto. 
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In another aspect, the invention relates to a recombinant polypeptide as described 
above, said polypeptide comprising amino acid sequence derived from an HTV gag 
gene product, said recombinant polypeptide sequence being mutated to disrupt 
5 processing of the gag gene product, and said gag sequence further comprising a 
disrupted myristoylation site, wherein substantially all of the CD8+ T cell epitopes of 
the naturally occurring gag sequence as defined in pl7 and p24 (gag) shown in 
Example 8 are retained in said recombinant polypeptide. 

10 In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein the processing of gag is disrupted by exchanging the pl7 and p24 
domains and wherein the myristoylation site is disrupted by mutation of the second 
glycine to alanine. 

15 In another aspect, the invention relates to a recombinant polypeptide as described 
above wherein said polypeptide comprises sequence as shown in SEQ ID NO: 13 or a 
sequence having at least 95% identity thereto. 

In another aspect, the invention relates to a recombinant polypeptide as described 
20 above wherein the HIV is a clade B HIV. 

In another aspect, the invention relates to a recombinant nucleic acid encoding a 
polypeptide as described above. 

25 In another aspect, the invention relates to a recombinant nucleic acid sequence 
comprising SEQ ID NO:ll, or a sequence which differs only by silent mutations with 
respect to the genetic code, or a sequence having at least 95% identity thereto. 

In another aspect, the invention relates to a viral vector encoding a polypeptide as 
30 described above. 
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In another aspect, the invention relates to a viral vector as described above wherein 
said vector is an MVA or MVA derived vector. 

In another aspect, the invention relates to a viral vector as described above wherein 
5 said vector is a fowlpox or fowlpox derived vector. 

In another aspect, the invention relates to a viral vector as described above wherein 
said vector is a FP9 fowlpox vector. Specific teachings with regard to this vector may 
be found in WO03/047617 which is incorporated herein by reference. 

10 

In another aspect, the invention relates to the use of a polypeptide as described above 
in medicine. 

In another aspect, the invention relates to the use of polypeptide as described above in 
1 5 the preparation of a medicament for the treatment or prevention of HTV infection. 

In another aspect, the invention relates to the use of polypeptide as described above in 
the preparation of a medicament for immunisation against HIV infection. 

20 In another aspect, the invention relates to the use of a nucleic acid as described above 
in medicine. 

In another aspect, the invention relates to the use of nucleic acid as described above in 
the preparation of a medicament for the treatment or prevention of HTV infection. 

25 

In another aspect, the invention relates to the use of nucleic acid as described above in 
the preparation of a medicament for immunisation against HIV infection. 



30 



In another aspect, the invention relates to a method of immunising a subject against 
HTV infection comprising administering to said subject a polypeptide or nucleic acid 
as described above. 
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In another aspect, the invention relates to the use of a polypeptide or nucleic acid as 
described above as a priming agent or as a boosting agent in a prime-boost 
immunisation regime. Prime boost immunisation is well known in the art, and specific 
teachings on this subject may be taken from WO98/056919, which is incorporated 
5 herein by reference. 

In another aspect, the invention relates to the use of a polypeptide or nucleic acid as 
described herein in the induction of an immune response. Said immune response may 
be, for example, a cellular immune response, such as a CD8+ or CD4+ response, or a 
10 humoral (antibody) response. The invention moreover provides a method for eliciting 
an immune response in a subject comprising administering to said subject, which may 
be in need of such administration, a polypeptide, nucleic acid or vector as herein 
described. 

15 In another aspect, the invention relates to a nucleic acid vector comprising a nucleic 
acid sequence as described above or encoding a polypeptide as described above. 

In another aspect, the invention relates to an adenovirus vector comprising a nucleic 
acid sequence as described above or encoding a polypeptide as described above. 

20 

In a further aspect, the invention relates to a vector based on VS V (vesicular stomatitis 
virus), adeno-associated virus (AAV), alphavirus, Sendai virus or Herpes Simplex 
virus comprising a nucleic acid sequence as described above or encoding a polypeptide 
as described above. 

25 

In another aspect, the invention relates to a poxvirus vector comprising a nucleic acid 
sequence as described above or encoding a polypeptide as described above. 

In another aspect, the invention relates to a plasmid selected from the group consisting 
30 of p29D.gpn, pOPK6.gpn and pSG2.gpn. 
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DETAILED DESCRIPTION OF THE INVENTION 
DEFINITIONS 

5 As used herein, the term "adenovirus" comprises the members of the Adenoviridae 
(adenovirus family). This family, in turn, comprises three genera: Mastadenovirus, 
Aviadenovirus and ATadenovirus. In particular, the invention contemplates the use of 
ovine adenovirus (an ATadenovirus). 

10 A "CD8+ T cell epitope" is an amino acid sequence which is a peptide recognised by 
CD8+ T cells usually in conjunction with a class I major histocompatibility complex. 
In particular, reference to "all" CD8+ T cell epitopes, and/or "all known" CD8+ T cell 
epitopes, refers to currently known epitopes, as defined in Example 8 hereto, HIV 
Molecular Immunology 2002: Maps of CTL Epitope Locations Plotted by Protein; 

15 Theoretical Biology & Biophysics, Los Alamos National Laboratory, August 7, 2003. 
In accordance with the present invention, substantially all human CD8+ T cell epitopes 
are retained in the modified polypeptides; however, CD8+ T cell epitopes relevant in 
other mammalian species, such as murine CD8+ T cell epitopes, may be lost. CD8+ T 
cells are synonymous with CTLs (cytotoxic T-lymphocytes). 

20 

A "T helper epitope", likewise, is a peptide recognised by T helper cells usually in 
conjunction with a class II major histocompatibility complex; "all" and/or "all known" 
T helper cell epitopes are as defined in Example 8, HIV Molecular Immunology 2002: 
Maps of CTL Epitope Locations Plotted by Protein; Theoretical Biology & 
25 Biophysics, Los Alamos National Laboratory, August 7, 2003. In accordance with the 
present invention, substantially all human T helper cell epitopes are retained in the 
modified polypeptides; however, T helper epitopes relevant in other mammalian 
species, such as murine T helper epitopes, may be lost. A T helper cell is synonynous 
with a helper T cell. 

30 

"Substantially all" means at least 99%; preferably, it means at least 98%, 97%, 96%, 
95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86% or 85%. In an alternative 
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embodiment, 'substantially all' refers to all except one, all except two, all except three, 
all except four, all except five, all except six, all except seven, all except eight, all 
except nine, all except ten, all except eleven, all except twelve, all except thirteen, all 
except fourteen, all except fifteen. 

5 

An epitope is considered to be 'lost' or 'destroyed' or 'not retained' if the peptide 
sequence to which it is mapped, for example with reference to Example 8, is mutated. 
Preferably all epitopes are retained, with reference to those shown in Example 8. 

10 A 'natural' sequence or 'reference' sequence as used herein refers to the source 
sequence from which the claimed polypeptide or nucleotide sequence(s) are designed 
or derived. Wherever possible this term should take its ordinary meaning ie. referring 
to the sequence of the corresponding gene(s) in a virus found in nature. However, it 
will be apparent to a person skilled in the art that there is not a single HIV virus found 

15 in nature, but rather there are many different clades and many different isolates or 
clones within those clades. Therefore, when a particular sequence according to the 
invention has been designed or derived from a particular clone or isolate of HTV, then 
the 'natural' or 'reference' sequence refers to that particular clone or isolate's own 
sequence. However, there are numerous consensus sequences known in the art which 

20 have been formed and indeed are sometimes updated by the comparison or 'pileup' of 
a number of different isolate or clonal sequences which may be divergent at one or 
more nucleotide or amino acid positions. In the situation where a construct according 
to the present invention has been designed or derived from such a consensus sequence, 
then this consensus sequence will be taken to be the natural or reference sequence. 

25 

Preferably all epitopes for the 'full length' g/p/n gene products as described herein are 
retained. Preferably all said epitopes are retained even when non-epitope containing 
regions of the gene product are truncated or deleted. 

30 The natural or reference sequence should preferably be considered on a gene-by-gene 
basis. For example, a construct according to the present invention could comprise a 
gag gene, a nef gene and a pol gene. These genes will each have a natural or reference 
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sequence. It may be that the natural or reference sequences for each gene will be from 
the same source eg. the same overall consensus sequence, or it may be that the natural 
or reference sequence for each will be from a different source, eg. the gag and pol 
reference sequences may be from the 2001 clade B consensus sequence whilst the nef 
5 reference sequence may be from the 2000 clade B consensus sequence or vice versa. 
Reference sequences (both consensus and clonal) may be found in the Los Alamos 
Database as discussed herein (HIV Molecular Immunology 2002: Maps of CTL 
Epitope Locations Plotted by Protein; Theoretical Biology & Biophysics, Los Alamos 
National Laboratory, August 7, 2003). Specifically the 2001 clade B consensus 
10 sequence may be found at http://www.hiv.laiil.gov/content/hiv-» 
db/ALIGN 02/ALIGN-INDEX.html and the 2000 clade B consensus sequence may be 
found at http://www.hiv.lanl.gov/content/hiv-db/ALIGN 0 1/ALIGN-INDEX.html . 

"HIV" is Human Immunodeficiency Virus, a virus that causes immunodeficiency by 
1 5 attacking CD4+ cells in the body. The term "HIV", as used herein, includes any HIV, 
including all groups and subtypes (clades) of HIV-1 and HIV-2, for example HIV-1 M 
and HIV-1 O groups; the invention embraces each of the known clades; Clade B HIV- 
1 is preferred. Gag, pol and nef gene products are well known in the art and are as 
defined, for instance, in Example 8 hereto. 

20 

Guidance on the application of the invention to different clades may also be found in 
HIV Sequence Compendium 2002 Kuiken C, Foley B, Freed E, HahnB, Marx P, 
McCutchan F, Mellors J, Wolinsky S, and Korber B,editors. Published by the 
Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, LA-UR 

25 number 03-3564, incorporated herein by reference. In particular, consensus sequence 
data for HIV-1 clades A, B, C and D and a number of isolates can be found on pages 
490 to 550; consensus sequence data for HIV-2 clades A, B, C and D arid a number of 
isolates can be found one pages 554 to 578. Preferably the HIV is HIV-1 or HIV-2, 
most preferably HIV-1. Preferably the HIV is of clade A, B, C or D, preferably of 

30 clade B or D, preferably of clade B. An HIV sequence can be aligned to the HXB2 
reference sequence (example 8), and therefore the g/p/n genes can be identified and 
corresponding epitopes delineated, no matter which clade is used. This can be done by 
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eye, but is preferably done using the Los Alamos database's "HXB2 Numbering 
Engine" for this purpose rhti p://ww.tiv.lanl.gov/c^^ 
HXB27HXB2.MAIN.htmn , 

5 A "mutation", as referred to herein, encompasses any addition, deletion or substitution 
of amino acids in a polypeptide or nucleic acid. Mutations, in general, alter the amino 
acid sequence of the polypeptide in question such that it differs from a or the naturally 
occurring polypeptide sequence. 

10 In the context of the present invention, mutations are often introduced in order to 
abrogate or ameliorate a known biological activity of the viral protein. Thus, 
preferably when a sequence is said to be mutated with respect to the natural sequence, 
this means mutated to reduce or preferably remove one or more biological activities of 
the viral polypeptide. Examples of specific mutations which result in the reduction or 

15 removal of particular viral biological activities are presented herein. Further examples 
of motifs or domains associated with viral functions are known in the art. However, in 
accordance with the present invention, it is important that when the mutations are 
made to reduce/remove the biological functions, the naturally occurring epitopes are 
preserved as explained herein. 

20 

A "recombinant" polypeptide, as referred to herein, is a peptide whose sequence 
differs from a or the naturally occurring equivalent polypeptide and which may be 
produced by genetic recombination technologies, including DNA synthesis and 
manipulation. Recombinant peptides also includes peptides which are not produced by 
25 recombinant means, but which have designed using recombinant DNA technology or, 
preferably, have a sequence identical to a peptide designed by such technology. 



An "immune response" as referred to herein, is either a cellular (to include, but not 
limited to, CD4+ and CD8+) or humoral response to an antigenic sequence or a 
30 combination of both. 
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The gene names used herein to define segments of nucleic acid and/or polypeptide 
sequence such as 'gag', 'poP, 'nef and the like preferably have their ordinary 
meaning. Most often the terras are used to describe the polypeptide sequence, or the 
con-esponding nucleotide sequence encoding said polypeptide sequence, 

5 

A "protective immune response" as referred to herein, is an antigen specific immune 
response that provides a prophylactic and/or therapeutic benefit. 

VIRAL ESCAPE 

10 

Mutations can occur in any epitope and lead to viral escape. By providing a greater 
number of epitopes it is more likely that some of the epitopes will remain unmutated. 
Furthermore, by advantageously providing extra T helper epitopes, the immune 
response can be broadened and/or strengthened according to the present invention. 

15 

Thus, in one embodiment, the present invention advantageously counters 
immunodominant effects which can affect conventional vaccines. 

It is demonstrated that all of the identified nef human epitopes (both CD8+ T cell and 
20 T helper epitopes) in the HIV molecular immunology database are present in the GPN 
sequence of the present invention (SEQ ID NO: 1). 

As noted above, WO02/32943 (Nabel and Huang) makes various disclosures in the 
field of HIV vaccines. As will be apparent from this specification, the present 
25 invention is distinct from disclosures made therein. In preferred embodiments, the 
present invention relates to materials comprising and/or encoding HIV derived 
proteins which expressly exclude any of those disclosed in WO02/32943, and 
preferably exclude any having 95% or greater identity to those disclosed in 
WO02/32943. 

30 



As noted above, WO02/022080 (Merck and Co.) makes various disclosures in the field 
of HIV vaccines. As will be apparent from this specification, the present invention is 
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distinct from disclosures made therein. In preferred embodiments, the present 
invention relates to materials comprising and/or encoding HIV derived proteins which 
expressly exclude any of those disclosed in WO02/022080, and preferably exclude any 
having 95% or greater identity to those disclosed in WO02/022080. 

5 

As noted above, WO03/025003 makes various disclosures in the field of HIV. As will 
be apparent from this specification, the present invention is distinct from disclosures 
made therein. In preferred embodiments, the present invention relates to materials 
comprising and/or encoding HIV derived proteins which expressly exclude any of 
10 those disclosed in WO03/025003, and preferably exclude any having 95% or greater 
identity to those disclosed in WO03/025003. 

CE>8+ T CELL EPITOPES 

15 CE>8+ T cell epitopes can be identified experimentally and can be predicted by 
analysis of the sequence of interest. Preferably these epitopes are predicted/recognised 
using the ProPred program (epitope prediction program, employing a matrix based 
prediction algorithm as disclosed in Sturniolo et al. Nat. Biotechnol. 17. 555- 
561(1999) and Singh and Raghava (2001) Bioinformatics,17(12), 1236-37, such as 

20 may be found at (http://www.imtech.res.in/raghava/propred/) ). 

It is an advantage of the present invention that naturally occurring epitopes are 
preserved in the polypeptide(s) of interest and in nucleic acids encoding them. 

25 Addition or introduction of new CD8+ T cell epitopes may occur in the process of 
mutation and gene construction. 

T HELPER EPITOPES 

30 T helper epitopes can be identified experimentally and can be predicted by analysis of 
the sequence of interest. Preferably these epitopes are predicted/recognised using the 
ProPred program (epitope prediction program, employing a matrix based prediction 
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algorithm as disclosed in Sturniolo et al. Nat. Biotechnol. 17. 555-561(1999) and 
Singh and Raghava (2001) Bioinformatics,17(12), 1236-37, such as may be found at 
rhttp://ww.imtech.res.in/raghava/propred/) > ). 

5 Preferably, in the present invention naturally occurring T helper epitopes are preserved 
in the polypeptide(s) of interest and in nucleic acids encoding them. 

Furthermore, the present invention advantageously provides novel T helper epitopes 
which have the beneficial effect of boosting and/or broadening the immune response to 
10 an antigen bearing said epitopes. In a preferred embodiment of the present invention, 
increased numbers of T helper epitopes are provided. 

In another preferred embodiment of the invention, new T helper epitopes are created 
and/or introduced into the polypeptides of the present invention, thereby enhancing the 
1 5 immune response. 

In some embodiments, the invention comprises a recombinant strain FP9 fowlpox 
and/or a recombinant modified vaccinia virus Ankara, (MVA), each expressing a novel 
fusion protein containing antigenic peptide sequences found in the translation products 
20 of the gag, pol and nef genes of human immunodeficiency virus type I, (HIV-1), 
preferably clade B. 

The invention also relates to the use of such vectors in vaccination methods such as 
prime-boost (including single prime-multiple boost versions of prime-boost). Indeed, 

25 as discussed below, the use of either or both of these two recombinant viruses to boost 
a DNA plasmid-mediated prime has been shown to induce a strong immune response 
in mammals such as rodents and finds application in primates such as humans. The 
invention also relates to therapeutic immunotherapy for people infected with HIV-1, 
for example using the antigenic gene(s) of the present invention in combination with 

30 HAART, and/or a prophylactic immunotherapy for people at risk of infection with 
HIV-1. 
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Advantageously, vectors according to the invention employ appropriate codon usage to 
optimise protein expression from mammalian cells. Advantageously, human codon 
usage is employed. 

5 In a preferred embodiment, therapeutic antigen(s) of the present invention comprise a 
fusion protein based on the products of the products of the HIV-1 (preferably clade-B) 
gag, pol and nef genes. Preferably said antigen(s) are delivered by one or both of the 
two recombinant pox viruses described herein. 

10 In one aspect the invention provides a novel fusion protein containing antigenic 
peptide sequences derived from the translation products of the gag, pol and nef genes 
of human immunodeficiency virus type I, (HIV-1), preferably clade B. 

A preferred amino acid sequence of the fusion protein is shown in Fig 1. A preferred 
15 nucleotide sequence encoding this amino acid sequence is shown in Figure 6. 
Naturally a person skilled in the art will appreciate that due to degeneracy of the 
genetic code, numerous possible nucleotide sequences are possible and this is only one 
preferred example of same. 

20 The gene products are preferably in the order GAG-POL-NEF in the fusion protein. 
This is also the way they are arranged in the HIV genome. Other orders may be 
equally effective. It is well within the abilities of a person skilled in the art to alter the 
order to meet the needs of a particular application, or even merely to facilitate an 
easier construction and/or handling of the reagent(s). 

25 

In a preferred embodiment, the constructs of the present invention possess 
advantageous effects regarding immunodominance. Immunodominance has hindered 
prior art vaccines, especially with respect to gag epitope(s). This is discussed above. 
Three epitopes in GAG in a preferred GPN construct according to the present 
30 invention are studied in the example section and each mounted an immune response. It 
is further disclosed in a preferred embodiment how the immune response across the 
entire fusion protein is monitored (see below for more details). 
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It will be apparent that some of the GPN protein could be deleted, for example to 
reduce the size of the construct. Indeed, certain individual gene(s) are presented herein 
for individual application. Preferably the gpn constructs disclosed are not truncated. 
5 Without wishing to be bound by theory, it is suggested that a single protein may be 
less immunogenic if used alone as a smaller entity rather than used as disclosed within 
a gene. Thus, in a preferred embodiment, gpn genes disclosed herein are used without 
truncation/deletion. Preferably the g/p/n genes used are each full length. 

10 It will be apparent to the skilled reader that the immunogenic components of the 
present invention are generally polypeptides. However, the invention relates both to 
these polypeptides and to nucleic acids encoding them, whether these take the form of 
DNA, plasmids, viral vectors or other such entities. Indeed, the actual mode of 
production of the antigenic polypeptides may be varied by a person skilled in the art 

15 according to the particular application to which the invention is being put. The 
polypeptides may be made in vitro or preferably in vivo, whether intra- or extra- 
cellularly. Specific modes of use are described herein. 

GAG 

20 

The GAG protein has two functional domains, pi 7 matrix protein and p24 core capsid 
protein, both of which are essential and sufficient for assembly of HIV virus-like 
particles. In the gag gene(s) of the present invention, the p 17 and p24 domains have 
advantageously been exchanged to disrupt processing of the GAG protein, in 
25 accordance with the International Aids Vaccination Initiative (WO01/47955). Other 
disruptional rearrangements are possible, such as scrambling as was applied to nef (see 
below), so long as the naturally occurring CD8+ T cell epitopes are preserved. 

The inyristoylation site of the pl7 domain is advantageously mutated in order to 
30 prevent binding of the GAG protein to cellular membranes and, consequently, 
assembly and budding. This mutation is preferably by deletion or replacement of the 
second glycine residue in the domain with another amino acid, more preferably 
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replacement with an inert non-polar amino acid, most preferably by replacement of the 
second glycine residue in the domain with an alanine residue. This prevents GAG 
from functioning in binding to cellular membranes, in virus assembly and in budding. 

5 Advantageously the immunogenicity of gag is not adversely altered in the recombinant 
gene(s) of the present invention. 

Preferably 'full length' gag according to the present invention means comprising the 
full sequence of the pl7 and p24 domains. Preferably full length gag comprises only 
10 the full sequence of the pl7 and p24 domains. 

POL 

The pol gene encodes a number of proteins, including a protease, (plO), reverse 
1 5 transcriptase, (p5 1), RNase H, (pi 5) and an integrase, (p3 1). 

The functional amino acid residues for reverse transcriptase activity centre on the 
YMDD motif (shown in epitope map as RT 183-186) which is an area with a number 
of recorded epitopes. Advantageously the reverse transcriptase is inactivated without 

20 loss of epitopes by domain swapping mutation as described in more detail below. 
Thus, in the present invention, preferably the reverse transcriptase activity present in 
the P51 subunit is disrupted. This is preferably done by rearrangement of the encoding 
sequence such that an altered P51 polypeptide is made with disrupted RT activity. 
Most preferably the sequence TPDKKHQKEPPF (which is present close to centre of 

25 the P51 polypeptide), is duplicated and the sequence encoding the amino- and 
carboxy-terminal parts of the P51 polypeptide exchanged about the point between the 
duplicated sequences. 

This novel technique disrupts the active site in POL to ameliorate reverse transcriptase 
30 activity. Furthermore, this is advantageously accomplished according to the present 
invention while retaining the immunogenicity of the gene product. If the active site is 
mutated eg. by insertion, deletion or other known replacement type mutation, a T cell 
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epitope would be destroyed. According to the present invention such epitope(s) are 
retained. 

Preferably 'full length' pol according to the present invention means comprising all of 
5 the p51 domain of pol (ie. aa 1-440) which is often known as Reverse Transcriptase 
(RT). In Example 8, RT and RNase are combined in the section named RT CTL Map; 
appropriate care should be taken when reading this Example. Sometimes this may be 
referred to as '1-440 of RT'. Sometimes aa 1-440 of pol is referred to as 'truncated 
pol' (p51) to indicate that other parts of the full viral pol ORF are omitted such as the 
10 RNAse unit. However, references to 'pol', 'truncated pol' (p51) and 'full length pol' 
preferably mean pol p51 amino acids 1-440 as used herein and as will be apparent 
from the context. 

NEF 

15 

The nef gene encodes a multifunctional protein which enhances virus growth and 
mediates immune evasion. The NEF protein encoded by the gene in the current 
invention has been inactivated (ie. its function disrupted). In a preferred embodiment 
this inactivation is by dividing the coding region into eight subregions which were then 
20 reordered. Preferably these eight regions comprise 6 regions of 26 amino acid and two 
regions of 25 amino acids. A highly preferred scrambled NEF sequence is presented 
below (see Figure 2). Exact junctional boundaries in this sequence are easily 
identified by comparison to the native nef sequence (also presented below). 

25 The objective of scrambling according to the present invention is to disrupt the 
functions of nef whilst preserving its T cell epitopes. 



In scrambling nef according to the present invention, attention should be paid to 
disrupt the function of nef whilst creating a minimal number of surplus CD8+ T cell 
30 epitopes. 
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The preferred choice of 8 sub regions disclosed herein represents an advantageous 
balance between maintaining CD8+ T cell epitopes and not creating too many new 
junctional CD8+ T cell epitopes. In a preferred embodiment, creation of new 
junctional CD8+ T cell epitopes is minimised. In this context 'new' CD8+ T cell 
5 epitopes are those not occurring in the natural nef sequence. The average length of a 
CD8+ T cell epitope is 9 amino acids. 

The preferred GPN fusion protein according to the present invention is mapped to 
show where epitopes have been maintained and where new junctional epitopes have 
10 been made. This is discussed further below. 

When making a scrambled gene according to the present invention, it is advantageous 
to introduce junctional linkers in order to maintain any epitopes which span the chosen 
junctions. This may be easily accomplished by review of the sequence(s) using the 
15 ProPred tools and making correlating modifications as necessary. 

In a preferred embodiment, twelve residue linkers, spanning these subregion junctions, 
are inserted between the re-ordered subregions to restore T cell epitopes located at the 
junctions of the eight subregions in the original NEF protein sequence. 

Preferably 'full length' nef according to the present invention means comprising all of 
the amino acid sequence of the nef gene. Preferably this includes the N-terminal part 
(ie. nef aa 1-65) of the nef polypeptide which contains epitopes but has been deleted 
from prior art nef constructs. 

SEQUENCES 

Specific sequences are presented in the sequence listing. In brief, SEQ ID NO:l 
comprises a full exemplary gpn gene sequence, including tags. In a preferred 
30 embodiment, the tags are removed such as for human use, and an exemplary core GPN 
sequence is presented in SEQ ID NO:9. 
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SEQ ID NO:2 is the p24 gag fragment, SEQ ID N0:3 is the p!7 gag fragment. The 
GAG p24 construct begins M A P I V even though the natural or reference p24 
sequence does not contain M A. This is due to the preferred Kozak sequence used for 
translation (GCC GCC ACC ATG G ; SEQ ID NO: 14). ATG is the initiation codon 
5 and translates to Methionine (M). The DNA codons for Proline (P) begin with a C so 
an additional amino acid having DNA codon(s) beginning with G was inserted 
(alanine). 

In one embodiment, SEQ ID NO:2 and SEQ ID NO:3 are joined (last residue of SEQ 
10 ID NO:2 to first residue of SEQ ID NO:3) to make a preferred gag gene according to 
the present invention. 

SEQ ID NO: 4 represents a short pol p51 sequence which is advantageously duplicated 
in a preferred pol construct according to the present invention as explained herein. 
15 SEQ ID NO : 5 is the N and C terminal pol sequence. In one embodiment, a larger 
sequence is constructed in the order SEQ ID NO:4- SEQ ID NO:5- SEQ ID NO:4, 
resulting in duplication of SEQ ID NO:4, one repeat each side of SEQ ID NO:5 (ie. 
one at the N-teimiiius and one at the C-terminus of SEQ ID NO:5), thereby making a 
preferred pol gene according to the present invention. 

20 

SEQ ID NO: 6 is a scrambled nef according to the present invention. 

SEQ ID NO: 7 is a tag recognised by murine CD8+ T cells, and SEQ ID NO:8 is an 
HA tag. Either or both of these tags may be advantageously incorporated into gene 

25 sequences of the present invention, for example to monitor expression and/or to 
monitor immune response(s) and/or to monitor for persistence/processing of the 
protein via the N- or C- termini (preferably via the C-terminus). However, for 
embodiments involving human subjects, preferably these tags are not incorporated into 
the gene sequence(s) of the present invention, or are removed from or not expressed on 

30 said gene sequence(s). 
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SEQ ID NO:10 is the native HIV nef sequence of the HIV clade B consensus sequence 
protein. 

SEQ ED NO:l 1 is an exemplary nucleotide sequence coding for GPN. Naturally the 
5 person skilled in the art will appreciate that due to the degeneracy of the genetic code, 
many variant nucleotide sequences could equally code for the GPN of the present 
invention, especially those related to SEQ ID NO: 11 and varying only by 
translationally silent differences in nucleotide sequence. 



10 As used herein the term sequence 'derived from' an HIV gene product has its natural 
meaning in the art. Derived from simply indicates that it is based on the HIV sequence 
eg. as a starting point, whether this is in silico or actually experimental derivation eg. 
by cloning, PCR etc. For example the term would include predicted amino acid 
sequence from an HIV ORF, and is not limited to experimentally derived sequence. If 

1 5 a skilled person can recognise that the sequence originates from HIV, however much it 
has been rearranged or mutated, then it will be considered to be 'derived from* HIV. 
In a preferred embodiment, the sequence derived from another will possess a number 
of contiguous residues (amino acid or nucelotide) identical to the sequence from which 
is derived. Preferably there will be at least 5 such residues, preferably at least 8 such 

20 residues, preferably at least 10 such residues, preferably at least 14 such residues, 
preferably at least 18 such residues, preferably at least 20 such residues, preferably at 
least 22 such residues, preferably at least 25 such residues. For example, the gag, pol 
and nef sequences given herein are each derived from HIV. 



25 The term 'mutated' has its usual meaning and includes deletion, insertion, point 
mutation, truncation, inversion as well as site directed mutation and scrambling as 
described herein. 



It will be appreciated that the invention also embraces nucleic acid fragment(s) of the 
30 disclosed nucleotide sequences. Preferably nucleic acid fragments comprise at least 40 
nucleotides, preferably at least 50 nucleotides, preferably at least 100 nucleotides, 
preferably at least 200 nucleotides, preferably at least 400 nucleotides, preferably at 
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least 600 nucleotides, preferably at least 800 nucleotides, preferably at least 1000 
nucleotides, preferably at least 1500 nucleotides, preferably at least 2000 nucleotides, 
preferably at least 2500 nucleotides, preferably at least 3000 nucleotides, preferably at 
least 3400 nucleotides, preferably at least 3410 nucleotides. 

5 

It will be appreciated that the invention also relates to nucleic acids which differ from 
those presented only by virtue of the degeneracy of the genetic code. The open 
reading frame(s) of importance when judging variation connected to degeneracy of the 
genetic code will be those which encode polypeptide(s) of the present invention, 
10 particularly encoding core GPN (SEQ ID NO:9 and nucleic acids encoding it). 

Unnatural nucleotide residues or analogues or derivatised moieties may feature in 
nucleic acids according to the present invention. 

15 It will be appreciated that the invention also embraces polypeptide fragment(s) of the 
disclosed amino acid sequences. Preferably fragments are considered on a gene-by- 
gene basis. Preferably polypeptide fragments comprise at least 8 amino acids, 
preferably at least 9 amino acids, preferably at least 10 amino acids, preferably at least 
12 amino acids, preferably at least 15 amino acids, preferably at least 20 amino acids, 

20 preferably at least 25 amino acids, preferably at least 26 amino acids, preferably at 
least 30 amino acids, preferably at least 40 amino acids, preferably at least 60 amino 
acids, preferably at least 80 amino acids, preferably at least 100 amino acids, 
preferably at least 150 amino acids, preferably at least 200 amino acids, preferably at 
least 300 amino acids, preferably at least 400 amino acids, preferably at least 600 

25 amino acids, preferably at least 800 amino acids, preferably at least 1000 amino acids, 
preferably at least 1 108 amino acids, preferably at least 1 125 amino acids. 

It will be appreciated that when considering fragments of the polypeptides and nucleic 
acids of the present invention, the same importance is attached to the preservation of 
30 epitopes. That is to say, if only a part of a gene (such as amino acids 1-440 of reverse 
transcriptase (pol)) are present in a construct according to the present invention, then it 
is required that the corresponding naturally occurring epitopes for that part of the gene 
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product are preserved. This applies equally to CD8+ T cell epitopes and to T helper 
epitopes as appropriate. In general, and as set out above, longer fragments of the 
individual genes are preferred and most preferred are fullest length polypeptides as 
described herein and as set forth in the sequence listing. Clearly the same principles 
5 extend to the consideration of sequence identity, that is to say that when assessing 
sequence identity with regard to a fragment according to the present invention, first the 
fragment length is chosen and then the sequence comparison is made across the 
corresponding fragment length section of the natural or reference sequence. The 
invention requires that all epitopes present on that fragment length are preserved. 
10 Preferably full length g/p/n genes are considered as described herein. 

In one embodiment, the invention relates to gag/pol/nef polypeptides comprising 
strings of the naturally occurring epitopes, with only minimal connecting sequences. 

15 Unnatural amino acids or analogues thereof or derivatised moieties may feature in the 
polypeptides of the present invention. 

Relative sequence identity may be determined by computer programs which can calculate 
the percentage identity between two or more sequences using any suitable algorithm for 

20 determining identity, using for example default parameters. A typical example of such 
a computer program, is CLUSTAL (see Thompson et al, 1994 (NAR 22:4673-80) or 
http://ww.psc.edu/general/so Alternatively, the 

BLAST algorithm is employed, with parameters set to default values. The BLAST 
algorithm is described in detail at http://www.ncbi.nih.gov/BLAS^last_help.html, 

25 which is incorporated herein by reference. 

Other computer programs used to determine identity and/or similarity between 
sequences include but are not limited to the GCG program package (Devereux et al 
1984 Nucleic Acids Research 12:387), FASTA (Atschul et al 1990 J Mol Biol 403- 
30 410) and the GENEWORKS suite of comparison tools. 
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FASTA uses the method of Pearson and Lipman (Proc. Natl. Acad. Sci. USA 85; 
2444-2448 (1988)) to search for similarities between one sequence (the query) and any 
group of sequences. FASTA uses the following search parameters: these can be 
advantageously set to the defined default parameters: Matrix: as for BLAST (not used 
5 by FASTA for nucleotide comparisons). Wordsize - the number of continuous 
residues or bases which are considered at once in the initial comparison; default is 6 
for nucleotide sequences, 2 for amino acid sequences. Gap penalty: This is the number 
of points deducted from a similarity score when a new gap is created; default is 16 for 
nucleotide sequences, 12 for amino acid sequences. Gap extension penalty: This is the 

10 number of points deducted from a similarity score when an existing gap is enlarged; 
default is 4 for nucleotide sequences, 2 for amino acid sequences. Expect: this restricts 
the number of sequences returned according to statistical significance; default is 2, 
List: this restricts the number of homologous sequences which are reported; default is 
40. Align: this restricts the number of homologous sequences for which alignments 

1 5 are displayed; default is 1 0. 

FASTA is available via Biology WorkBench at the University of Illinois 
(http://biologv.ncsa.uiuc.edu/) , or from the Genetics Computer Group (GCG). 

20 BLAST (Basic Local Alignment Search Tool) is a heuristic search algorithm employed 
by the programs blastp, blastn, blastx, tblastn, and tblastx; these programs ascribe 
significance to their findings using the statistical methods of Karlin and Altschul (see 
http://www.ncbi.nih.gov/BLAST/blast_help.html) with a few enhancements. The 
BLAST programs were tailored for sequence similarity searching, for example to 

25 identify homologues to a query sequence. For a discussion of basic issues in similarity 
searching of sequence databases, see Altschul et al (1994:Nature Genetics 6:119-29). 

BLAST uses the following search parameters: these can be advantageously set to the 
defined default parameters: HISTOGRAM - Displays a histogram of scores for each 
30 search; default is yes. DESCRIPTIONS - Restricts the number of descriptions of 
homologous sequences reported; default is 100. EXPECT - The statistical significance 
threshold for matches between sequences, according to the stochastic model of Karlin 
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and Altschul (1990: PNAS 87:2264-8); default is 10. ALIGNMENTS - Restricts the 
number of sequences for wliich alignments are displayed; default is 50. MATRIX - 
Specifies a scoring matrix For BLASTP, BLASTX, TBLASTN and TBLASTX. The 
default matrix is BLOSU1V162 (Henikoff & Henikoff 1992:PNAS 89:10915-9). 
5 STRAND - Restrict a searcli to one or other strands of the sequence, (if a nucleotide 
sequence); default is both strands. FILTER - Masks off segments of the query 
sequence which have low complexity, as determined by the SEG program of Wootton 
& Federhen (1993: Computers in Chemistry 17:149-163), or segments consisting of 
short-periodicity internal repeats, as determined by the XNU program of Claverie & 
10 States (1993: Computers azid Chemistry 17:191-201) or by the DUST program of 
Tatusov and Lipman (see http://www.ncbi.nlm.nih.gov); default filtering is DUST for 
BLASTN, SEG for other programs. 

Most preferably, sequence comparisons are conducted using the FASTA alignment 
15 tool. 

Although in general the sequence comparison techniques mentioned herein are well 
known in the art, reference may be made in particular to Sambrook et al., Molecular 
Cloning, A Laboratory Manual (1989) and Ausubel et al, Short Protocols in 
20 Molecular Biology (1999) 4 th Ed, John Wiley & Sons, Inc. 

The present invention embraces sequences possessing significant identity to the 
exemplary sequences disclosed herein. Preferably a sequence has at least 40% identity 
to a sequence disclosed herein, preferably at least 45% identity, preferably at least 50% 

25 identity, preferably at least 55% identity, preferably at least 60% identity, preferably at 
least 65% identity, preferably at least 70% identity, preferably at least 75% identity, 
preferably at least 76% identity, preferably at least 78% identity, preferably at least 
80% identity, preferably at least 85% identity, preferably at least 90% identity, 
preferably at least 91% identity, preferably at least 92% identity, preferably at least 

30 93% identity, preferably at least 94% identity, preferably at least 95% identity, more 
preferably at least 96% identity, preferably at least 97% identity, preferably at least 
98% identity, preferably at least 99% identity, or even more. 
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Clearly for amino acid sequences of polypeptides according to the present invention, 
the requirement for preservation of epitopes means that the sequence identity may be 
concentrated in areas of known epitopes where the sequence identity will preferably be 

5 100% within the actual epitope sequence(s). Therefore, the regions of inter-epitope 
amino acid sequence may vary to a greater degree than the actual epitope regions, 
which preferably remain constant with respect to the reference sequence. Preferably 
the percentage identity figure is judged across the whole corresponding lengths of the 
relevant sequences being compared, including the epitope and inter-epitope regions. 

10 In another embodiment, the identity figures can relate to the variable regions only ie. 
the inter-epitope regions only, the epitopes being taken to be 100% conserved in this 
embodiment. 

OPTIMISATION OF GENE CONSTRUCTS 

15 

In a preferred embodiment, a 'Kozak' consensus sequence, GCC GCC ACC ATG G, 
is placed upstream of the gene construct(s). 

In a preferred embodiment, the codon usage of the sequence may be modified to 
20 optimize the efficiency of translation of the gag-pol-nef transcript in human cells. 

In a preferred embodiment, an Apa I and an Asc I restriction endonuclease recognition 
site are placed at the 5-prime and 3-prime ends of the gag-pol-nef (gpn) gene 
respectively. This advantageously facilitates subcloning of the gene cassette. In this 

25 embodiment, the restriction sites will be present in recombinant virus but are 
advantageously outside the open reading frame of the GAG-POL-NEF fusion protein. 
Consequently they will not have any influence on the amino acid sequence of the 
fusion protein nor the immune response generated to that protein. Apal and AscI were 
also chosen to advantageously allow direct cloning into the pOPK6 recombinant 

30 plasmid. They have the further advantage of exerting no known influence on gene 
function. 
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In a preferred embodiment, a sequence encoding a reporter CD8+ T cell epitope, 
RGPGRAFVTI (SEQ ID NO:7), recognized by murine CD8-positive T cells specific 
for the gp 160 protein, may "be incorporated into a recombinant gene of the present 
invention. This has the advantage of allowing monitoring of the induction of CD8+ T 
5 cells following immunisation with GPN-containing vaccines. Advantageously the 
presence of the epitope is not known to affect the function of the fusion gene. In a 
highly preferred embodiment, this reporter epitope is absent from 
construct(s)/recombinant geae(s) for primate vaccination, such as human vaccination. 

10 In a preferred embodiment, an additional antibody tag, YPYDVPDYA (SEQ ID 
NO: 8), recognized by antibodies specific for this part of the influenza virus 
haemagglutinin protein, is added to the gene of the present invention (or to the nucleic 
acid encoding it). Preferably this is added to the carboxyterminus of the protein to 
allow the detection of expression in antibody-based immunoassays, such as 'western 

15 blot assays'. In a highly preferred embodiment, this tag is incorporated into the 
carboxyterminus of a recombinant gag-pol-nef gene, such as in a recombinant vector 
comprising a recombinant gag-pol-nef gene. In a highly preferred embodiment, this 
antibody tag is absent from construct(s)/recombinant gene(s) for primate vaccination, 
such as human vaccination. 

20 

PHARMACEUTICAL COMPOSITIONS 

The present invention also provides a pharmaceutical composition comprising 
administering a therapeutically effective amount of the agent of the present invention 
25 (such as a recombinant HIV gene such as the recombinant gpn gene as discussed 
herein) and a pharmaceutical^ acceptable carrier, diluent or excipients (including 
combinations thereof). 

The pharmaceutical composition may comprise two components - wherein a first 
30 component comprises a nucleic acid vector and a second component which comprises 
a viral vector thereof. The first and second component may be delivered sequentially, 
simultaneously or together, and even by different administration routes. 
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The pharmaceutical compositions may be for human or animal usage in human and 
veterinary medicine and will typically comprise any one or more of a pharmaceutically 
acceptable diluent, carrier, or excipient. Acceptable carriers or diluents for therapeutic 
5 use are well known in the pharmaceutical art, and are described, for example, in 
Remington's Pharmaceutical Sciences, Mack Publishing Co. (A. R. Gennaro edit. 
1985). The choice of pharmaceutical carrier, excipient or diluent can be selected with 
regard to the intended route of administration and standard pharmaceutical practice. 
The pharmaceutical compositions may comprise as - or in addition to - the carrier, 
10 excipient or diluent any suitable binder(s), lubricant(s), suspending agent(s), coating 
agent(s) or solubilising agent(s). 

Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, 
15 sorbic acid and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents 
may be also used. 

There may be different composition/formulation requirements dependent on the 
different delivery systems. By way of example, the pharmaceutical composition of the 
20 present invention may be formulated to be delivered using a mini-pump or by a 
mucosal route, for example, as a nasal spray or aerosol for inhalation or ingestible 
solution, or parenterally in which the composition is formulated by an injectable form, 
for delivery, by, for example, an intravenous, intramuscular or subcutaneous route. 
Alternatively, the formulation may be designed to be delivered by both routes. 

25 

Where the agent is to be delivered mucosally through the gastrointestinal mucosa, it 
should be able to remain stable during transit though the gastrointestinal tract; for 
example, it should be resistant to proteolytic degradation, stable at acid pH and 
resistant to the detergent effects of bile. 

30 

Where appropriate, the pharmaceutical compositions can be administered by 
inhalation, in the form of a suppository or pessary, topically in the form of a lotion, 
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solution, cream, ointment or dusting powder, by use of a skin patch, orally in the form 
of tablets containing excipients such as starch or lactose, or in capsules or ovules either 
alone or in admixture with excipients, or in the form of elixirs, solutions or 
suspensions containing flavouring or colouring agents, or they can be injected 
5 parenterally, for example intravenously, intramuscularly or subcutaneously. For 
parenteral administration, the compositions may be best used in the form of a sterile 
aqueous solution which may contain other substances, for example enough salts or 
monosaccharides to make the solution isotonic with blood. For buccal or sublingual 
administration the compositions may be administered in the form of tablets or lozenges 
1 0 which can be formulated in a conventional manner. 

PHARMACEUTICAL COMBINATIONS 

The agent of the present invention may be administered with one or more other 
15 pharmaceutical^ active substances. By way of example, the present invention covers 
the simultaneous, or sequential treatments with an agent according to the present 
invention and one or more steroids, analgesics, antivirals or other pharmaceutical^ 
active substance(s). 

20 The invention also finds application in a therapeutic immunotherapy for people 
infected with HIV such as HIV-1. The invention may be used in combination with 
HAART, and/or in combination with a prophylactic immunotherapy for people at risk 
of infection with HIV such as HTV-l. 

25 It will be understood that these regimes include the administration of the substances 
sequentially, simultaneously or together. 

The present invention will now be described by way of example, which is not intended 
to limit the scope of the appended claims, in which reference will be made to the 
30 following figures: 



Brief Description of the Figures 
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Figure 1 shows preferred gpn gene constructs. The sequence is continuous and has 
only been separated here for clarity. Bold indicates sequence variation between GPN 
sequence and the HIV Molecular Immunology sequence (NB. not all such sequence 
5 variations are shown). 

Figure 2 shows scrambled and native nef gene sequences used for MHC Class II 
epitope comparisons. 

Figure 3 shows T helper epitopes for scrambled nef (fig 3 A) and native nef (fig 3B). 
Figure 4 shows a map of recombination plasmid, p29D.gpn, for the construction of a 
1 0 recombinant fowlpox strain FP9 expressing gag-pol-nef. 

Figure 5 shows a map of recombination plasmid, pOPK6.gpn, for the construction of a 
recombinant MVA expressing gag-pol-nef. 

Figure 6 shows the sequence of GPN. The GPN sequence is shown in normal text. The 

upstream region is shown in italics. 5-prime Apal and 3 -prime AscI sites are 
1 5 underlined. Initiating ATG and terminating TGA are shown in bold. 

Figure 7 shows a bar chart of an IFN-y ELISpot as described in Example 3. 

Figure 8 shows a bar chart of an IFN-y ELISpot as described in Example 5. 

Figure 9 shows overlapping 20mer peptides used in an IFN-y ELISpot assay. Amino 

acid length shown in brackets. 
20 Figure 10 shows a bar chart of GAG-specific responses in ex.4 group 4 animals. 

Figure 1 1 shows a bar chart of POL-specific responses in ex.4 group 4 animals. 

Figure 12 shows a bar chart of NEF-specific responses in ex.4 group 4 animals. 

Figure 13 shows bar charts of IFN-y responses. 

Figure 14 shows a bar chart of IFN-y responses elicited against selected peptides from 
25 gpn-sequence. 

EXAMPLES 

Example 1: Analysis of CD8+ T cell and T helper epitopes in the GPN sequence. 

30 

We used ProPred to compare T helper epitopes for native NEF versus scrambled NEF. 
This program predicts that there are more T helper epitopes in scrambled NEF 
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according to the present invention. This demonstrates that the scrambled NEF 
sequence is at least as potent in eliciting T helper and CD8+ T cell responses as native 
NEF. 

5 CD8+ T cell Epitopes 

CD8+ T cells can recognise and elitninate virally infected T cells and have been 
associated with control of viraemia in HIV infection in man and SIV infection in 
monkeys. 

10 Comparison of the GPN sequence (Figure 1) with the HXB2 reference sequence 
published in the HIV molecular immunology database ("HIV Molecular Immunology 
2002: Maps of CD8+ T cell Epitope Locations Plotted by Protein" Theoretical Biology 
and Biophysics, Los Alamos National Laboratory. August 7, 203; http://hiv- 
web.lanl.gov/content/immunologv/maps/ctl/ctl.pdf) is performed. 

15 

This comparison reveals that all of the known murine and human CD8+ T cell epitope 
sequences identified for native gag (pi 7, p24), truncated-pol (p51) and nef are present 
in the GPN polypeptide sequence. There are single amino acid differences between the 
GPN sequence and the HXB2 reference sequence. Epitope sequences are indicated on 
20 the protein sequences of HXB2 and provide a relative location of the defined epitopes 
although they may vary relative to the protein sequence from which they were defined 
(refer to HIV Molecular Immunology Database 2002: Section II-A-2 - HIV Protein 
Epitope Maps, p56; http://hiv-web.lanl.gov/content/immunology /pd^2002/ 
immuno2002.pdf). 

25 

Therefore, shuffling and rearrangements within the GPN sequence have not altered per 
se the capacity of this molecule to induce CD8+ T cell responses in mice, monkeys or 
man against the native gag (pl7, p24), truncated-pol (p51) and nef sequences present 
in HIV. 

30 

Scrambling and rearrangement of the native polypeptides constituting the GPN protein 
may have created new CD8+ T cell epitopes unique to this molecule. Since these 
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epitopes are unlikely to be present in native gag, pol or nef sequences within the HIV 
virus, they are therefore unlikely to be of biological relevance for a novel antigen for 
inclusion in a vaccine against HIV. Consequently, these epitopes have not been 
investigated further in this example. 

5 

T helper epitopes 

T helper immune responses enhance the CD8+ T cell effector response in terms of 
magnitude and breadth and therefore may enhance the efficacy of the CD8+ T cell 
response against HIV infection. 

10 

Comparison of the GPN sequence with the HXB2 reference sequence published in the 
HIV molecular database ("HIV Molecular Immunology 2002: Maps of T helper 
Epitope Locations Plotted by Protein" Theoretical Biology and Biophysics, Los 
Alamos National Laboratory. August 7, 203; http://hiv- 

15 web.lard.gov/content/immunology/maps/lielDer/helper.pdf) reveals that all of the 
human T helper epitope sequences identified for native gag (pi 7, p24), truncated-pol 
(p51) and nef are present in the GPN polypeptide sequence. Therefore, shuffling and 
rearrangements within the GPN sequence have not reduced the capacity per se of this 
molecule to induce T helper responses against the native gag (pl7, p24), truncated-pol 

20 (p5 1) and nef sequences present in the clade B virus in humans. 

Since T helper epitopes are usually longer than 15 amino acids, fusion of the gag, 
truncated-pol and nef sequences in GPN is likely to have advantageously created novel 
epitopes bridging the points at which these genes or parts of genes have been fused. 

25 Moreover, shuffling of the nef gene is likely to have created a number of novel 
epitopes bridging the shuffled sections of resulting polypeptide. Such novel T helper 
epitopes advantageously may elicit immune responses when GPN is administered as a 
vaccine, thereby enhancing the breadth and strength of the biologically-relevant CD8+ 
T cell responses elicited by the polypeptide(s) of the present invention relative to the 

30 native GAG, truncated-POL and NEF proteins. 
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To demonstrate that shuffling had created potential new T helper epitopes, the native 
NEF and scrambled NEF polypeptide sequences (Figure 2) are compared using the 
Propred MHC Class II epitope prediction program, employing a matrix based 
prediction algorithm as disclosed in Sturniolo et al. Nat. Biotechnol. 17. 555- 
5 561(1999) and Singh and Raghava (2001) Bioinformatics,17(12), 1236-37, such as 
may be found at fhttp://www.imtech.res.in/raghava/t>ropred/ )> 

Comparison of these molecules reveals substantial changes in the predicted human 
MHC class II restricted T helper epitopes, with a greater number of predicted epitopes 
10 in the scrambled NEF sequence than the native NEF sequence (Figures 3 A and 3B). 

Thus, scrambling of the nef sequence has resulted in the creation of additional T helper 
epitopes for enhancement of the CD8+ T cell response against this polypeptide 
according to the present invention. These advantageous novel T helper epitopes 
15 comprise those epitopes shown in Figure 3A and which are absent from Figure 3B. 

Similar novel T helper epitopes will have been generated across the fusion points 
between the proteins comprising the GPN polypeptide. Comparison as described 
above may be used to define/investigate these further, 

20 

Example 2: Recombinant GPN gene in DNA, MVA and Fowlpox 

A recombinant gag-pol-nef (gpn) gene is constructed for expression of a GPN fusion 
protein. This gene construct is used in nucleic acid carrier such as DNA carrier (eg. 
25 plasmid carrier), as well as in MVA and fowlpox vectors and materials for 
construction of same. 

The use of recombinant virus vectors such as MVA and/or fowlpox vectors to boost a 
DNA plasmid-mediated prime has been shown to induce a strong immune response in 
30 rodents. 



Essentially the same constructs also find application in primates such as humans. 
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In this example, the therapeutic antigen delivered by the recombinant pox viruses 
comprises a fusion protein based on the products of the products of the HIV-1 clade-B 
gag, pol and nef genes. 

5 

The recombinant gag-pol-nef gene was synthesized as a series of overlapping 
oligonucleotides, amplified by use of polymerase chain reaction cloned into the 
commercially available plasmid, pUC19, to make pUC19.gpn, (or 02-213), and the 
nucleotide sequence determined. 

10 

The gag-pol-nef gene is subcloned into three plasmid vectors to make 

i) a plasmid expression plasmid, pSG2.gpn for use as a priming agent for in vivo 
use such as preclinical testing of gpn-containing recombinant poxvirus vectors; 

ii) a recombination plasmid, pOPK6.gpn, for the construction of a recombinant 
1 5 MVA expressing gag-pol-nef; 

iii) a recombination plasmid, p29D.gpn, for the construction of a recombinant 
fowlpox strain FP9 expressing gag-pol-nef. 

The expression plasmid, pSG2.Mel3, (Palmowski, MJ. et al 2002 J Immunol 168 
20 p439 1-4398) contains an expression cassette based on the human cytomegalovirus 
immediate-early promoter and intron combined with the bovine growth hormone gene 
polyadenylation signal to express a synthetic epitope string containing melanoma 
antigens. The expression plasmid pSG2.gpn is made by digesting pUC19.gpn with 
Apal and AscI, blunt-ending the insert, and subcloning into blunted Pstl-digested 
25 pSG2Mel3. 

The MVA recombination plasmid, pOPK6 is made as follows. The plasmid is based on 
the commercially available cloning plasmid, pSP72, (Promega Inc.); the standard 
30 multiple cloning site being replaced with a synthetic linker containing unique 
restriction sites and the vaccinia virus P7.5 late-early promoter and late Pll promoter 
arranged in a head-to-head orientation. The linker also has the first twenty nucleotides 
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of the Escherichia coli LacZ, (beta-galactosidase), gene as found in the vaccinia virus 
recombination plasmid pSCll, (Chakrabarti et al 1985 Mol Cell Biol 5 p3403-3409). 
The synthetic linker is synthesized as a series of overlapping oligonucleotides, 
amplified by PCR, and cloned into the commercial cloning plasmid, pcDNA3.1, to 
5 make pLink, (or 02-363). The sequence of the 300 base pair linker is given below: 

agatcmaattaatgcctaggcaattgagacggcgcgcccgggcactagtaagggcccgtgcaataaattagaatatattttc 
tactmaccagaaattaattgtacaamattatttetgggtgaaaaacttactataaaaagcgggtgg^ 
catgcatcttagaatatatgtatgtaaaaatatagtagaam 
1 0 ctcctgcaggcatgctaagctagcggccggccctcgag 

Part of the LacZ gene from pSCl 1 is subcloned into pLINK to make pLinkLacZ. The 
linker and LacZ gene were then subcloned into pSP72 as a Xhol-Bgin fragment such 
that a complete beta-galactosidase-encoding open reading frame is formed 3-prime to 
15 the P 1 1 late promoter to make pSP72LinkLacZ. 

MVA flanking regions representing 1500 base pairs 5-prime and 3-prime to the EcoRI 
site in the MVA thymidine kinase (tk) gene were isolated from MVA by PCR using 
the following primer-pairs: 

20 

Left-hand, (5-prime) tk flank 

Forward primer CAATTACAGATTTCTCCGTGrATAGGT 
25 Reverse primer CGTGCATGCGGCCGCAACAATGTCTGGAAAGAACTG 

Right-hand, (3-prime) tk flank 
30 Forward primer CAGGAATTCGCGGCCGCTGTGAGCGTATGGCAAACG 



Reverse primer TCATTTGCACTTTCTGGTTCGTA 
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The 1500bp PCR products are cloned into the commercial cloning vector pTOPO-TA, 
(Invitrogen) and sequenced. 

5 

The right-hand flank is subcloned into pSP72LinkLacZ as an EcoRI-Mfel fragment to 
make p72LinkLacZR. The left-hand flank insert is then subcloned into p72LinkLacZR 
as an Nhel-SphI fragment to make pOPK6, 

10 Subsequent sequence analysis of this plasmid identified an additional 174 base pairs of 
sequence in the pOPK6 backbone plasmid introduced from the pTOPO-TA vector with 
the left-hand flank due to non-specific cleavage by the Nhel enzyme. 

The gag-pol-nef open reading frame was subcloned from pUC19.gpn into pOPK6 as 
15 an Apal-AscI fragment to make pOPK6.gpn and the gpn insert sequenced to confirm 
no alterations had occurred during the subcloning steps. 

This plasmid functions as a vehicle for introducing the gpn open reading frame, under 
control of the P7.5 late-early promoter, into the tk locus of MVA. 

20 

FP9 vector 

The FP9 strain fowlpox recombination plasmid, pEFL29, was obtained from Dr Mike 
Skinner at the Institute for Animal Health, Compton, UK as published by Qingzhong, 
25 Y. et al 1994, Vaccine 12(6) p569-573. 

The gpn open reading frame was subcloned from pUC19.gpn as a blunted KpnI-SacI 
fragment into the Smal site of pEFL29 to make pEFL29.gpn. Sequencing of this 
plasmid showed that LacY and part of LacA had been introduced with the LacZ 
30 marker gene during the construction of pEFL29. Consequently, a linker was 
synthesized that allowed the deletion of all of the LacY and most of the remaining 
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LacA open reading frames by means of a BsrGI-BlpI digest of pEFL29 to make 
p29Delta. The sequence of the linker is given below. 

TGAGCGCCGGTAGATACCATTATCAGCTGGTGTGGTGTCAGAAGTAATGTA 
5 C 

The gpn expression cassette was then subcloned from pEFL29.gpn as an Aatll-Sphl 
fragment into Aatll-Sphl cut p29Delta to make p29D.gpn. The gpn insert was then 
sequenced to confirm no alterations had occurred during the subcloning steps. 

10 

The recombination plasmids, pOPK6.gpn and p29D.gpn are used to introduce the 
P7.5-gpn expression cassette into MVA and FP9, respectively. 

The recombinant poxviruses were made by homologous recombination of the virus 
15 genome with the relevant recombination plasmid in chicken, embryo fibroblast, (CEF), 
cultures using standard molecular biology techniques as described in Current Protocols 
in Molecular Biology, Ed. F.M. Ausubel, John Wiley <& Sons; or according to 
suppliers' or manufacturers' instructions. 

20 MVAgpn is made by infecting confluent cultures of CEF cells with a non- 
recombinant MVA, (stock 575FHE-K), derived from MVA passage number 575, 
(Mayr et al. 9 (1978) Zentralbl Bakteriol [b]. 167(5~6):375-90) at a multiplicity of 
infection of 0.1 for one hour. The infected cells are then transfected with between 0.5 
and 2.0^g pOPK6.gpn plasmid DNA mixed with lipofectia (Invitrogen), according to 

25 the manufacturer's protocol. The cultures are incubated for three days before 
harvesting. The harvested cells are freeze-thawed three tiiraes and titrated as ten-fold 
dilutions and cultured under CMC-containing medium for three days. The medium is 
removed and replaced with CMC-medium containing X-Gal and the cultures incubated 
in this medium for 20 hours. 

30 

Recombinant viruses are identified by a blue plaque phexiotype and picked with a 
pipette into cryotubes containing lOOjal medium. The picked viruses are plaque 
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purified five more times to ensure purity. Purity from non-recombinant parental virus 
was confirmed by the absence of white plaque phenotype and the absence of a WT 
PCR product when the virus DNA is screened using appropriate primers. 

5 Purity PCR screening was performed on virus DNA purified from 2ml cultures 
infected with fifth or sixth round clones of MVA.gpn. 

The virus DNA is purified using a Qiagen Blood and Cell Culture DNA min-kit 
(Qiagen GmbH, Max Volmar str. 4, 40724 Hilden, Germany) according to the 
10 manufacturer's protocol. A PCR reaction was performed using Klentaq proof reading 
polymerase (Sigma) using appropriate incubation conditions. 

Parental virus was screened for using the following primers to give a band of 471 base 
pairs in the wild-type, but nothing in MVA; 

15 

TKU CAATTACAGATTTCTCCGTGATAGGT 

TKL TCATTTGCACTTTCTGGTTCGTA 

20 MVA.gpn was screened for using primer TKL and the following primer to give a band 
of approximately 2000 base pairs from the MVA.gpn construct: 

HIV-U ATACCCCCGTGTTCGCCATTAAGA 

25 Expression of the GPN protein from MVA.gpn is confirmed by immunoblotting as 
described below. 

CEF cells are infected with MVA.gpn or MVA.LacZ control virus at a multiplicity of 
infection of 1. The culture is incubated for 3 days before harvesting. The cells are 
30 heated to 100 degrees C in SDS sample buffer and electrophoresed through a 4-20% 
acrylamide gel and electroblotted onto a nitrocellulose membrane. The transferred 
protein is then probed with either an antibody specific for the haemagglutinin tag 
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(Abeam), or one specific for HIV GAG P24, (Dako) and visualized using a suitable 
peroxidase-labelled secondary antibody and a chromogen. The blot assay indicates that 
the cells infected with MVA.gpn contain a protein of approximately 134kDa 
containing P24 and HA epitopes that was absent in the cells infected with MVA.LacZ. 
5 The predicted molecular weight of the GPN protein is 128.5 kDa, which is in close 
agreement with the specific band produced in the MVA.gpn-infected cells. 

Expression of the GPN protein from FP9.gpn is confirmed by immunoblotting. CEF 
cells are infected with FP9.gpn or FP9.LacZ, ('FP9.29D'), control virus at a 

10 multiplicity of infection of 1. The culture is incubated for 5 days before harvesting. 
The cells are heated to 100 degrees C in SDS sample buffer and electrophoresed 
through a 4-20% acrylamide gel and electroblotted onto a nitrocellulose membrane. 
The transferred protein is then probed with either an antibody specific for the 
haemagglutinin tag (Abeam), or one specific for HIV GAG P24, (Dako) and visualized 

15 using a suitable alkaline phosphatase-labelled secondary antibody and a chromogen. 
The blot assay indicated that the cells infected with FP9.gpn contained a protein of 
approximately 134kDa containing P24 and HA epitopes that is absent in the cells 
infected with FP9.29D. The predicted molecular weight of the GPN protein is 128.5 
kDa, which is in close agreement with the specific band produced in the FP9.gpn- 

20 infected cells. 

The immunogenicity of the MVA.gpn and FP9.gpn viruses was tested in BALB/c mice 
using a 'prime-boost' protocol. The mice were divided into groups of four animals. 
The following test and control groups were established: 

25 

pSG2.gpn (50n) prime followed 14 days later by a boost lxlO 6 p.f.u. of MVA.gpn. 
pSG2.gpn (50|a) prime followed 14 days later by a boost lxlO 6 p.f.u. of FP9.gpn. 
30 lxlO 6 p.f.u. of MVA.gpn alone. 
lxlO 6 p.f.u. of FP9.gpn alone. 
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7 days after virus infection the peripheral blood lymphocytes were harvested and the 
number of GPN-specific interferon gamma-secreting lymphocytes determined in an 
ELISpot assay using the KQV following T cell epitopes (see also example 3): 

5 

H2-D CD8 epitopes: AMQMLKETI 

TTSTLQEQ 
Gpl 60 H2-D CD8 epitope: RGPGRAFVTI 
H2-D CD4 epitope: NPPIPVGEIYKRWIILGLNK 

10 

Both MVA.gpn and FP9.gpn are shown to induce strong CD8+ T cell responses in the 
mice. 

The prime-boost protocol is shown to give greater response than that seen in the 
1 5 groups receiving a single inj ection with virus. 

Example 3: Immunisation with FP9.gpn and MVA.gpn elicits antigen specific 
CD8+ T cell responses 

20 In this example it is demonstrated that FP9.gpn and MVA.gpn elicit enhanced antigen- 
specific CD8+ T cell responses when administered alone or in a prime-boost 
immunisation regime with pSG2.gpn. In this example, the demonstration is presented 
in mice. 

25 Female BALB/c mice (6-8 weeks old) are immunised with 50jig of pSG2.gpn by 
intramuscular (im.) injection and boosted with 1 X 10 6 PFU FP9.gpn or 1 X 10 5 
MVA.gpn by intravenous injection (iv.) two weeks later. A further group of age 
matched naive female BALB/c mice are immunised iv. with 1 X 10 6 PFU FP9.gpn or 1 
X 10 5 MVA.gpn at the same time as the booster immunisation. 

30 

Fourteen days after the booster immunisation, all mice are sacrificed by cervical 
dislocation and the T cell response elicited against three H-2 d restricted CD8+ epitopes 
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from the GPN polypeptide (Table A: AMQ, TTS, RGP) is deteixnined by IFN-y 
ELISpot assay as described below. 

Murine IFNy ELISpot protocol: 

5 

Materials: 

IFN-y ELISpot ALP Kit Mabtech 3321-2A 
600jxg anti-IFN-y purified Mab AN18 
10 50|xg anti-IFN-y biotinylated Mab R46A2 
50(Ltl Streptavidin-Alkaline Phosphatase 

Complete a-MEM medium 
500ml MEM a-modification Sigma M-4526 
15 50ml FCS [10%] Sigma F-2442 

5ml pen/strep [100U penicillin 100ng strep] Sigma P-0781 

10ml L-glutamine [4mM] Sigma G-7513 

500|il 2-Mercaptoethanol [50^tm] Gibco BRL 31350-010 

20 ACK buffer 

8.29gNH 4 Cl [0.15M] (Sigma A-4514) 

lgKHC0 3 [lmM] (Sigma P-9144) 
37.2mg Na 2 EDTA (Sigma ED2SS) 
800ml milli-Q water 
25 Adjust pH to 7.2-7.4 with HC1 (Sigma S-7653) 
Make up to 1000 ml with water and autoclave 

Colour Development Buffer: 

BioRad AP Conjugate Substrate kit (170-6432 ). 

30 For one plate: 

5ml deionised water 
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200jli1 of25x buffer 
50fxl reagent A 
50|il reagent B 

Mix well and use immediately 

5 

Protocol 

1 . Preparation of Plates : 

1.1. Coating plates: coat MAP multiscreen plates (Millipore MAIPS4510) with 
1 0 rat anti-mouse IFNy (Mab AN1 8) antibody. Dilute to 1 Oyg/ml in Phosphate 

Buffered Saline (PBS; Sigma P-3813) and add 50^1 per well to MAIP plates. 
Incubate overnight at 4°C in a humidified chamber 

1.2. Blocking plates: Flick off coating antibody and wash plates once witii 150 ul 
of sterile PBS (Sigma P-3813) per well using a multi-channel pipette. Flick 

15 off the PBS, add lOOul complete a-MEM medium per well, and incubate at 

room temperature for 1+ hour. It is important to keep the plates steriLe at this 
stage. 

Splenocyte preparation: 

2.1. Crush individual spleens in 2 ml of PRS with the plunger of a 10 ml syringe in 
a 70 (am cell strainer (Falcon 352350) contained in a petri dish, add 5 ml of 
PBS, suspend splenocytes by pipetting, and transfer into a 50 ml tube. Rinse 
cell strainer and dish with a further lO ml of PBS and add to the 50 mil tube. 
Centrifuge at 1500 rpm for 5 min. 

2.2. Remove supernatant, re-suspend cells by tapping tube and add 5ml ACK buffer 
and mix by inversion. Incubate at room temperature for no longer than 5 
minutes. Add 25 ml PBS, mix by inversion and centrifuge at 400 X g for 5 

30 min. 
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2.3.Remove supernatant re-suspend pellet by tapping the tube, add 10ml PBS and 
vortex. Count using an improved Neubauer haemacytometer by diluting 1:10 
in 0.4% trypan blue solution (Sigma T-8154). Aliquot amount needed for the 
Elispot and centrifuge at 1500 rpm for 5 min, resuspend by vortexing in an 
5 appropriate volume of complete Alpha MEM medium to give a concentration 

of 10 million cells/ml. 

3. Plate setup: 

10 50jal 50^1 50|al 

< < > ->-> < ► -» -> 

1 2 3 4 5 6 7 8 9 10 11 12 
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15 Note: Plate layout should be varied according to needs. This layout is convenient for 
this example. 

3.1.Flick blocking media from plate and add 50jxl of complete alpha MEM 
medium to columns 3, 4, 7, 8, 1 1&12. 
20 3. 2. Add 150^1 of splenocytes to columns 2, 6 and 10 in duplicate. (Up to 12 
samples per plate) 

3.3.Take 50(il of splenocytes from columns 2, 6 and 10 and transfer to columns 1, 
5 and 9 respectively: these are the negative control wells. 
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3.4.Serially dilute each sample by taking 50|xl from columns 2, 6 and lO, to 

columns 3, 7 and 1 1, mix well and transfer 50jal to 4, 9 and 12. Discard 50^1 

after mixing final columns in dilution. 
3.5.Add test peptide and control peptide to twice the desired final concentration to 
5 naive splenocytes at 10 million/ml in complete oc-MEM medium. Add 50|j,l of 

control peptide and targeT cells to columns 1, 5 and 9. Add 50\x\ test peptide 

and targeT cells to remaining columns. 
3.6.Incubate plates at 37°C for 18-20 hours. 

10 4. Developing the Assay 

4.1. Wash plates twice with PBS containing 0.05% Tween 20 (Sigma PI 379), once 
with distilled water and twice with PBST. 

4.2. Add 50^1/well of biotinylated rat anti-mouse interferon-gamma diluted to 
1 5 1 p,g/ml in PBS. Incubate for 2 hours at room temperature. 

4.3. Wash plates four times with PBST, then add 50 |il Streptavidin Alkaline 
Phosphatase (Mabtech) diluted to l|ig/ml in PBS. Incubate at room 
temperature for 1 hour. 

4.4. Wash plates four times with PBST, add 50(al/well of colour development buffer 
20 Incubate at room temperature until spots develop (approx. 10 min). Wash 

plates well with tap water, peel off plastic bottom and leave to dry overnigh-t on 
paper towels. 

Calculation 

25 

Results are calculated as the number of epitope-specific IFNy spot forming 
cells/million splenocytes (sfc/million). Differences between groups are determined by 
one-way ANOVA and a post hoc Tukey-Karamer multiple comparison test on logio 
transformed data using GraphPad Instat version 3.05. 

30 

Results 
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The demonstration that immunisation with FP9.gpn and MVA.gpn alone or in prime- 
boost regimes with pSG2.gpn elicits antigen specific CD8+ T cell responses in mice is 
illustrated in figure 7. Female BALB/c mice were immunised im. with pSG2.gpn 
5 (DNA) or sham immunised with PBS ( - ) and boosted 14 days later with FP9.gpn 
(FP9) or MVA.gpn (MVA) as described above. The CD8+ T cell response was 
determined in splenocytes 14 days after the booster immunisation using the IFN-y 
ELISpot assay. Columns represent the mean IFN-y spot forming cells/million 
splenocytes ± standard deviation for 4 mice per group elicited by CD8+ reactive 
1 0 epitopes AMQ, ITS and RGP (see Table A). 

Table A: GPN epitopes used in this study 
( l ND = not determined) 



Antigen 


Abbr. 


Epitope 


CD4/CD8 
reactive 


MHC 
restriction 


HIV-1 
env 


RGP 


RGPGRAFVTI 


CDS 


D a 


HIV-1 
gag 


AMQ 


AMQMLKETI 


CD8 


K a 


HIV-1 
gag 


TTS 


TTSTLQEQ 


CD8 


H-2" 


HIV-1 
gag 


NPP 


NPPEPVGEIYKRWULG 
LNK 


CD4 


H-2 d 



15 

Single or prime-boost immunisation with either FP9.gpn or MVA.gpn elicited a 
significantly (P<0.01) enhanced antigen-specific T cell response against each of the 
CD8+ T cell epitopes when compared to sham- immunised controls (Figure 7). 

20 
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In addition, priming with pSG2.gpn and boosting with FP9.gpn or MVA.gpn elicited 
significantly (P<0.01) enhanced responses against each CD8+ T cell epitope when 
compared to single immunisations with each virus (Figure 7). 

5 It should be noted that the virus titres were not identical in this experiment. Thas, the 
relative immune potency of the viruses has not been compared. Naturally it is 
straightforward to assess this by performing the example with identical viral titres . 

Thus it is demonstrated that specific CD8+ T cell responses are elicited against the 
10 recombinant HIV genes according to the present invention such as the gpn gene by 
immunisation with either FP9.gpn or MVA.gpn, indicating that these constructs are 
both potent in eliciting an immune response against the GPN polypeptide. 

Moreover, potent immune responses are directed at epitopes lying in the N- and C- 
15 terminal regions of the GPN polypeptide, indicating that the whole polypeptide is 
expressed, processed and presented after delivery with either FP9.gpn or MVA.gpn. 

Both viruses elicit significantly higher immune responses against each epitope when 
administered to animals that have been primed with pSG2.gpn, indicating that both can 
20 act as boosting agents in prime boost immunisation regimes. 

Example 4: Immunogenicity of FP9.gpn and MVA.gpn in primates 

The immunogenicity of the gpn immunogens is demonstrated in vivo in primates as 
25 follows. 

The immunogenicity of the GPN polypeptide consisting of the gag, pol and nef 
proteins of clade B HIV-1 is tested in non-human primates (Macaca mulatta). 

30 The polyprotein (human codon usage) is expressed in recombinant MVA, 
fowlpoxvirus FP9, adeno virus and a DNA vaccine vector. The polyprotein expressing 
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constructs are administered in a heterologous immunisation regimen in order to induce 
high levels of antigen-specific CD8+ and CD4+ T cells. 

Table B shows the study design of the immunogenicity studies. 

5 

Table B 



Group No 
n=5 


Prime 1 


Prime 2 


Boost 1 


Boost 2 




DayO 


Day 28 


Day 56 


Day 84 


1 


FP9.gpn 


FP9.gpn 


MVA.gpn 


MVA.gpn 


2 


MVA.gpn 


MVA.gpn 


FP9.gpn 


FP9.gpn 


3 




FP9.gpn 


MVAgpn 




4 


pDNA+IL2- 
Ig/fc 


MVAgpn 


FP9.gpn 


Adeno.gpn 



The design of the macaque study of this example is shown in Table B. 

10 

Blood samples are taken pre-immunisation and 7 days after each immunisation and 
four weeks after the last immunisation. PBMCs were cryopreserved and tested in 
ELISpot and intracellular cytokine assays using pools of overlapping peptides. 

15 Cellular immune responses are tested during and at the end of the study. The 
frequency of IFN-y-secreting T cells is tested by IFN-y ELISpot assays (see Figure 9 
for overlapping peptides used). 

In an additional group, for comparitive purposes, DNA vaccines adjuvanted with EL- 
20 2/Ig fusion proteins (Barouch, D. H., A. Craiu, et al. (2000) Proc Natl Acad Sci XJ S A 
97(8): 4192-7) and recombinant adenovirus are tested. For study design see Table B. 

There are several parameters that are presented in this study: 
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(a) Order of FP9 and MVA. The most effective combinations of poxvirus vector 
primes and boosts are demonstrated. This particularly focuses on groups 1 and 2 of the 
study design (see Table B). 

5 

(b) Frequency of primes and boosts. The effectiveness (in terms of immunogenicity) of 
priming with a single poxvirus immunisation and boosting with a single heterologous 
poxvirus vector boost is demonstrated. This particularly focuses on animals in group 3 
which will be immunised as outlined. 

10 

Read-out: The key objective of this vaccination strategy is to induce cellular immune 
responses. Therefore the following assays are used to monitor cellular immune 
responses: IFN-y ELISpot using overlapping peptides (20mers overlapping by 10 
amino acids as shown in Fig 9), intracellular cytokine staining for CD8+ and CD4+ T 
15 cells. In ELISpot assays CD4/8 depletion experiments will confirm IFN-y secretion in 
response to peptides by CD4+ and/or CD8+ T cells. 

Data for group 4 is presented below. The data shows immunogenicity of HIV 
immunotherapeutics in rhesus macaques using novel prime-boost immunisation 
20 regimens. 

Table 1 Immunisation and sampling schedule 



Group 


d-28 


dO 


d2 | 


d7 


d28 


d35 


d56 


d63 


4 


Bleed 


Bleed; 


i.m. inj. 


Bleed 


Bleed; 


Bleed 


Bleed; 


Bleed 






i.m. inj. 


of 5 mg 




i.d. inj. 




i.d. inj. 








of 5 mg 


pfu 




of 5xl0 8 




of5xl0 8 








pfu 


pS-IL- 




pfu of 




pfu of 








pSG2. 


2fc 




FP9. 




MVA. 








gpn 






gpn 




gpn 





25 
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Results 

The GPN insert is immunogenic and induces interferon-gamma secreting T cells 
following viral vector immunisation in macaque monkeys. 

5 

In group 4 gpn-specific T cell responses are detected in 4/5 animals seven days after 
the FP9.boost. The responses are maintained following MVA boost. 

This data shows that the gpn fusion protein is immunogenic in primates when it is 
1 0 delivered using recombinant vectors. 

GAG, POL and NEF of gpn are recognised in immunised macaque monkeys. 

Figures 10, 11 and 12 show responses of individual animals to different parts of the 
15 gpn protein indicating that T cell epitopes from the proteins gag, pol and nef are 
recognised in the same responding animal. For example animal N93 in Group 4 shows 
T cell responses recognising the GAG, POL and NEF portion of the GPN protein 
indicating that cell ,s can be induced using GPN. Furthermore, 

without wishing to be bound uy uieory, the breadth o 47 fU ~ ; ~mune response to the 
20 diverse elements of the triple GPN polypeptide indicates an advantageous feature of 
the invention in avoiding immunodominance effects where a single epitope can 
dominate the immune response to a polypeptide at the cost of responses to other 
epitopes present on the polypeptide. By contrast, the polypeptide according to the 
present invention of this example advantageously demonstrates a broad and balanced 
25 response. 

Primate Immunogenicity Single Administration Experiments 

The immunogenicity of the GPN polypeptide (SEQ ID No. 1) was tested in 10 rhesus 
30 macaques (Macaca mulatta), labelled A-J, in single administration ("single shot") 
experiments. In this example the polypeptide was delivered via either the MVA or 
FP9 vector. 
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ELISpot assays were carried out as described above. Responses to 119 overlapping 
~20mer peptides (as described in Example 7) were measured and then summed to give 
a total response. 

Responses were measured prior to injection and 28 days post injection. Responses are 
shown below. GPN elicited an immune response in all animals studied. 





Day 0 
Bleed then 
i.d. 5x10* pfu 
MVA.gpn 


Day 
28 
Bleed 


Summed ELISpot response 
SFC/millionPBMC 


Animal 


A 


13.35 


28.33 


B 


6.70 


203.33 


C 


23.40 


133.33 


D 


6.65 


20.00 


E 


0.00 


320.00 






Day 0 
Bleed then 
i.d. 5x1 0 8 pfu 
FP9.gpn 


Day 28 
Bleed 


Summed ELISpot response 
SFC/millionPBMC 


Animal 


F 


0 


9.90 


G 


28.35 


48.33 


H 


3.35 


31.67 


I 


0 


98.33 


J 


0 


345.00 
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The results indicate that the GPN polypeptide is immunogenic and induces interferon- 
gamma secreting T cells following viral vector immunisation in primates such as 
macaque monkeys. 

5 

Thus the effectiveness of the recombinant genes of the present invention is 
demonstrated in vivo in primates. 

Example 5: Heterologous prime-boost immunisation regimens with FP9.gpn and 
10 MVA.gpn elicit enhanced T cell responses. 

In this example it is demonstrated that FP9.gpn and MVA.gpn elicit enhanced antigen- 
specific CD4+ and CD8+ T cell responses when administered to subjects in 
heterologous or homologous prime-boost immunisation regimens. In this example, the 
1 5 effect is demonstrated in mice. 

Female BALB/c mice (6-8 weeks old) are immunised with either 1 X 10 5 PFU 
FP9.gpn or the 1 X 10 6 of MVA.gpn by iv. injection. Animals were boosted two 
weeks later with either virus in an identical manner to the initial immunisation. 

20 

Fourteen days after the booster immunisation, all mice were sacrificed by cervical 
dislocation and the T cell responses elicited against three CD8+ epitopes (Table A; 
AMQ, TTS, RGP) and a CD4+ epitope (Table A: NPP) from GPN were determined by 
IFN-y ELISpot assay as described in the above examples. 

25 

Results were calculated as the sum of the number of epitope-specific IFNy spot 
forming cells/million splenocytes (sfc/million). Differences between groups were 
determined by one-way ANOVA and a post hoc Tukey-Karamer multiple comparison 
test on logio transformed data using GraphPad Instat version 3.05. 

30 

Results 
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The demonstration that immunisation with FP9.gpn and MVA.gpn in prime-boost 
regimes elicits enhanced T cell responses against GPN in mice is illustrated in figure 8. 
Female BALB/c mice were immunised iv. with FP9.gpo (FP9) or MVA.gpn 
(MVA)sham immunised with PBS ( - ) and boosted 14 days later with either virus as 
5 described above. The CD8+ T cell response was determined in splenocytes 14 days 
after the booster immunisation using the IFN-y ELISpot assay. Columns represent the 
sum of the mean IFN-y spot forming cells/million splenocytes =fc standard deviation for 
4 mice per group elicited by AMQ, TTS, RGP and NPP (see Table A). 

10 Heterologous prime-boost immunisation with FP9.gpn and MTVA.gpn in either order 
elicited a significantly (P<0.001) enhanced overall T cell response against epitopes 
from GPN when compared to subjects given homologous immunisations with either 
virus (Figure 8). 

15 Heterologous immunisation with FP9.gpn/MVA.gpn or MVA.gpn/FP9.gpn elicits 
significantly higher T cell responses against GPN than homologous immunisation with 
FP9.gpn/FP9.gpn or MVA.gpn/MVA.gpn. 

Moreover, these viruses can advantageously be used interchangeably as priming and 
20 boosting agents with each other. 

Thus the efficacy of recombinant HIV genes such as the gpn gene is demonstrated, in 
particular when delivered by viral vector such as fowlpox or M"VA based vectors. 

25 Example 6: Heterologous prime-boost immunisation with FP9.gpn and MVA.gpn 
elicits enhanced CD8+ and CD4+ T cell responses against the individual 
component proteins of gpn 

It is demonstrated that heterologous prime-boost immunisation with FP9.gpn and 
30 MVA.gpn or FP9.gpn and MVA.gpn elicits enhanced antigen-specific CD4+ and 
CD8+ T cell responses against the component parts of the GPN polypeptide. In this 
example, the demonstration is presented in mice. 
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Female BALB/c and C57/BL6 mice (6-8 weeks old) are immunised with either 1 X 
10 6 PFU FP9.gpn or 1 X 10 6 PFU MVA.gpn by iv. injection. Animals are boosted two 
weeks later with either virus in an identical manner to the initial immunisation. 

5 

Fourteen days after the booster immunisation, all mice are sacrificed by cervical 
dislocation and the T cell responses elicited against a library of peptides (20 amino 
acids overlapping by 10 amino acids) derived from the GPN sequence determined 
using the IFN-y ELISpot assay as described in the above example. 

10 

Results were calculated as the sum of the number of epitope-specific IFNy spot 
forming cells/million splenocytes (sfc/million) for each peptide. 

Results 

15 

Heterologous prime-boost immunisation with FP9.gpn and MVA.gpn (administered in 
either order) elicits enhanced T cell responses across the GPN sequence when 
compared to homologous immunisation regimes with the same vectors. 

20 Moreover, these T cell responses include CD8+ and CD4+ responses, as well as being 
directed at epitopes derived from several regions of the GPN polypeptide. 

Thus it is demonstrated that heterologous prime-boost is a highly effective application 
of the recombinant gene(s) of the present invention. 

25 

Example 7: Epitope Mapping of GPN in Mice 

The gene encoding a gag, pol and nef polyprotein (gpn; figure 1; SEQ ID NO: 11) of 
the Human Immunodeficiency Virus (HIV) was inserted into pox virus vectors: 
30 modified vaccinia virus Ankara (MVA.gpn) and attenuated fowlpox virus (FP9.gpn). 
The immune response elicited in BALB/c mice against gpn by these vectors in prime- 
boost immunization regimens was evaluated. These studies showed that T cell 
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responses can be detected in the gag, pol and nef regions of this polypeptide using 
pools of overlapping peptides. One aim of this study was to identify individual 
peptides within responding peptide pools, thereby defining the location of the 
responding epitopes within the gag, pol and nef regions of the polypeptide. 

5 

In this example, immunogenic epitopes within each responding peptide pool are 
identified, the T cell subset responsible for eliciting the identified peptide-specific 
EFN-Y responses is delineated and sequences corresponding to the identified epitopes 
are compared to other studies. 

10 

Methods 

Mice and Immunisations 

15 Female BALB/c mice (H2d; 6-8 weeks) were used in all experiments and kept in 
individually ventilated cages in accordance with the Animals (Scientific Procedure) 
Act 1986 of the U.K. 1 x 106 pfu of recombinant virus was administered intravenously 
(i.v.; 100/x into the tail vein). 

20 Immunological Assays 

ELISPOT assays were performed as described previously. A cut-off value of 3 times 
standard deviation of the negative control (responses >60 SFCs per million) was 
applied to detect positive responses. 

25 

Depletion of CD4+and CD8+ cells 

A single cell suspension.was prepared in PBS containing 1%FCS and incubated for 15 
minutes at 4°C with either anti-CD4 or anti-CD8 MACS beads (Miltenyi Biotech®, 
30 Germany), according to the manufacturer's instructions. On completion of the 
incubation, cells were washed once (PBS, 1%FCS) and loaded onto a MACS column 
which was placed on a MACS magnet. CD4 and CD8 cell content before and after cell 
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subset depletion was determined by two-colour flow cytometry. CD4+-depleted 
splenocytes contained <1% CD4+ cells whereas >6% of CD8+ cells were still present 
in the splenocyte preparation following CD8+ depletion. 

5 Peptides 



10 



A total of 119 overlapping ~20-mer peptides spanning the entire gpn-sequence were 
synthesized by Natural and Medical Sciences Institute (NMI; Reutlingen, Germany) 
(Table 1). Peptides were arranged into 6 peptide pools. 

Table 1: Peptides used in this study 



Peptide 


Amino Acid Sequence 


Peptide 


# 




Length 


1 


MAPIVQNLQGQMVHQAISPR 


20 


2 


GQMVHQAISPRTLNAWVKW 


20 


3 


RTLNAWVKWEEKAFSPEVI 


20 


4 


EEKAFSPEV1PMFSALSEGA 


20 


5 


PMFSALSEGATPQDLNTML 


19 


6 


ATPQDLNTMLNTVGGHQAAM 


20 


7 


NTVGGHQAAMQMLKETI 


17 


8 


AAMQMLKETINEEAAEWDRL 


20 


9 


NEEAAEWDRLHPVHAGPIA 


19 


10 


LHPVHAGPIAPGQMREPR 


18 


11 


IAPGQMREPRGSDIAGTTSTL 


21 


12 


SDIAGTTSTLQEQIGWM 


17 


13 


STLQEQIGWMTNNPPffV 


18 


14 


WMTNNPPIPVGEIYKRWIIL 


20 


15 


GEIYKRWHLGLNKIVRMY 


19 


16 


LGLNKIVRMYSPTSILDIRQ 


20 


17 


SPTSDLDIRQGPKEPFRDYV 


20 



gag pool 1 
20 peptides 
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18 


GPKEPFRDYVDRFYKTLRA 


19 




19 


VDRFYKTLRAEQASQEVKNW 


20 


20 


EQASQEVKNWMTETLLVQNA 


20 










21 


MTETLLVQNANPDCKTILKA 


20 


gag pool 2 = 2t 
peptides 


22 


NPDCKTILKALGPAATLEEM 


20 


23 


LGPAATLEEMMTACQGV 


17 


24 


EEMMTACQGVGGPGHKARVL 


20 


25 


GGPGHKARVLMAARASVL 


18 


26 


VLMAARAS VLS GGELDRWEK 


20 


27 


SGGELDRWEKERLRPGGKKK 


20 


28 


IRLRPGGKKKYKXKHIVWA 


19 


29 


KYKLKHIVWASRELERFAV 


19 


30 


ASRELERFAVNPGLLETSEGCR 


22 


31 


GLLETSEGCRQILGQLQPSL 


20 


32 


RQILGQLQPSLQTGSEELR 


19 


33 


SLQTGSEELRSLYNTVATLY 


20 


34 


SLYNTVATLYCVHQRIEVK 


19 


35 


YCVHQREVKDTKEALEKI 


19 


36 


KDTKEALEKffiEEQNKSKKK 


20 


37 


EEEQNKSKKKAQQAAADTGN 


20 


38 


AQQAAADTGNSSQVSQNY 


18 


39 


GNSSQVSQNYTPDKKHQK 


18 


40 


NYTPDKKHQKEPPFLWMGY 


19 




41 


KEPPFLWMGYELHPDKWTVQ 


20 


pol pool 1 = 23 
peptides 


42 


ELHPDKWTVQPIVLPEKDSW 


20 


43 


PIVLPEKDSWTVNDIQKLV 


19 


44 


WTVNDIQKLVGKLNWASQIY 


20 


45 


GKLNWASQIYAGIKVKQLCK 


20 
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46 


AGDCVKQLCKLLRGTKAL 


18 




47 


CKLLRGTKALTEVIPLTEEA 


20 




48 


TEVIPLTEEAELELAENREI 


20 




49 


ELELAENREILKEPVHGVYY 


20 




50 


LKEPVHGVYYDPSKDLIAEI 


20 




51 


DPSKDLIAEIQKQGQGQWTY 


20 




52 


IQKQGQGQWTYQIYQEPFK 


19 




53 


TYQIYQEPFKNLKTGKYARM 


20 




54 


NLKTGKYARMRGAHTNDVKQ 


20 




55 


RGAHTNDVKQLTEAVQKIA 


19 




56 


KQLTEAVQKIATESIVIWGK 


20 




57 


ATESIVIWGKTPKFKLPIQK 


20 




58 


TPKFKLPIQKETWEAWWTEY 


20 




59 


ETWEAWWTEYWQATWDPEW 


19 




60 


YWQATWIPEWEFVNTPPLVK 


20 




61 


EFVNTPPLVKLWYQLEKEPI 


20 




62 


LWYQLEKEPIVGAETFPI 


18 




63 


PIVGAETFPISPIETVPVKL 


20 












64 


SPBBTVPVKLKPGMDGPKVK 


20 


pol pool 2 = 23 


65 


KPGMDGPKVKQWPLTEEKIK 


20 


peptides 


66 


KQWPLTEEKKALVEICTEM 


20 




67 


KALVEICTEMEKEGKISKI 


19 




68 


MEKEGKISKIGPENPYNTPV 


20 




69 


GPENPYNTPVFAKKKDSTK 


20 




70 


FAKKKDSTKWRKLVDFREL 


20 




71 


WRKLVDFRELNKRTQDFWEV 


20 




72 


NKRTQDFWEVQLG1PHPAGL 


20 




73 


VQLGIPHPAGLKKKKSVTVL 


20 




74 


LKKKKSVTVLDVGDAYFSV 


19 




75 


LDVGDAYFSVPLDKDFRKY 


19 
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76 


VPLDKDFRKYTAFTIPSI 


18 




77 


KYTAFTIPSINNETPGIRYQ 


20 


78 


NNETPGIRYQYNVLPQGWK 


19 


79 


YQYNVLPQGWKGSPAIFQ 


18 


80 


GWKGSPAFQSSMTKILEPF 


20 


81 


S SMTKJLEPFRKQNPDIVrY 


20 


82 


RKQNPDIVIYQYMDDLYV 


18 


83 


IYQYMDDLYVGSDLEIGQHR 


20 


84 


GSDLEIGQHRTKIEELRQHL 


20 


85 


TKJEELRQHLLRWGFTTPDK 


20 


86 


LRWGFTTPDKKHQKEPPFLV 


20 




87 


KHQKEPPFLVWKFDSRLAFH 


20 


nef pool 1 = 16 
peptides 


88 


WKFDSRLAFHHMARELHPEY 


20 


89 


HMARELHPEYYKDCDPEKEV 


20 


90 


YKDCDPEKEVLVWKFDA 


17 


91 


KEVLVWKFDANEGENNSLLH 


20 


92 


NEGENNSLLHPMSLHGM 


17 


93 


LLHPMSLHGMDDPEKEVPEK 


20 


94 


DDPEKEVPEKVEEANEGENG 


20 


95 


VEEANEGENGPG1RYPLTF 


19 


96 


GPG1RYPLTFGWCFKLVPV 


19 


97 


FGWCFKXVPVEPEKVEEWQ 


19 


98 


VEPEKVEEWQNYTPGPGIRY 


20 


99 


NYTPGPGIRYQKRQDILDLW 


20 


100 


YQKRQDELDLWVYHTQGYF 


19 


101 


LWVYHTQGYFPDWQNYTPEGL 


21 


102 


DWQNYTPEGLIYSQKRQDI 


19 










103 


LIYSQKRQDIPMTYKAALDL 


20 


nef pool 2 = 17 
peptides 


104 


PMTYKAALDLSHFLKEKGGL 


20 
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105 


SHFLKEKGGLEGLIYSPQV 


19 




106 


LEGLIYSPQVPLRPMTYKAA 


20 




107 


PLRPMTYKAADCAWLEAQ 


18 




108 


AADCAWLEAQEEEEVGFPVR 


20 




109 


EEEEVGFPVRPQVPLRNTAA 


20 




110 


PQVPLRNTAANNADCAWLA 


19 




111 


ANNADCAWLADGVGAVSRDL 


20 




112 


DGVGAVSRDLEKHGAITSSNTA 


22 




113 


HGAITSSNTAANNRRAEPAA 


20 




114 


ANNRRAEPAADGVGAMGGKW 


20 




115 


DGVGAMGGKWSKRSWGW 


18 




116 


KWSKJRSWGWPTVRERMRRA 


20 




117 


PTVRERMRRAEPARGPGRAF 


20 




118 


EPARGPGRAFVTIYPYDV 


18 




119 


AFVTIYPYDVPDYA 


14 












AMQ 


AMQMLKETI 


9 


Immunodomi 
nant control 


RGP 


RGPGRAFVTI 


10 


Reporter 

epitope 

control 



Results 

5 Identification of immunogenic epitopes within each responding peptide pool 

Individual epitope mapping of responding peptide pools was conducted by WN-y. 

ELISPOT assay was performed using overlapping peptides from each pool (see Figure 
10 13). Figure 13 shows IFN-7 responses elicited against peptide pools and overlapping 



WO 2005/030964 



PCT/GB2004/004038 



63 



peptide fragments covering the entire gpn-sequence (with markers). Groups of female 
BALB/c mice (H2 d ; n=4 mice) were immunised intravenously (i.v.) with 1 x 10 6 
plaque forming units (pfu) of MVA.gpn. Fourteen days later, mice were boosted by 
i.v. administration of 1 x 10 6 pfu of FP9.gpn. Fourteen days after boosting, spleens 
5 were removed, pooled (n=4 spleens) and the number of IFN-y spot forming cells (SFC) 
per million splenocytes was determined by IFN-y ELISPOT. Columns represent the 
number of SFC/million. 



In summary these results indicate that the following individual peptides in responding 
1 0 peptide pools were identified: 



In peptide pool gagl : peptide 7: NTVGGHQAAMQMLKETI 

peptide 8: AAMQMLKETINEEAAEWDRL 
peptide 15: GEr^lWELGLNKIVRMY 

15 

In peptide pool poll : peptide 50: LKEPVHGVYYDPSKDLIAEI 



In peptide pool pol2: peptide 66: KQWPLTEEKIKALVEICTEM 
peptide 85: TKIEELRQHLLRWGFTTPDK 

20 

In peptide pool nef2: peptidel 1 8: EPA RGPGRAFVTI YPYDV 



The underlined sequence in nef2 corresponds to the reporter epitope that was added to 
provide a measure of the induction of DFN-7 responses. This response in the 
25 penultimate peptide, 118, indicates that the entire GPN polyprotein is being expressed. 
However the IFN-y response detected in peptide pool nef2 was not due to epitope 
sequences derived from native nef but due to the inserted reporter epitope. Example 8 
contains very few CD8+ T cell epitopes in nef for BALB/c mice (H-2d) so the lack of 
response in nef is not surprising. 

30 

Determination of the T cell subset responsible for eliciting the identified peptide- 
specific IFN-y responses 



WO 2005/030964 



PCT/GB2004/004038 



64 

Figure 14 shows IFN-7 responses elicited against selected peptides from gpn-sequence. 
Groups of female BALB/c mice (H2 d ; n=4 mice) were immunised intravenously (i.v.) 
with 1 x 10 6 plaque forming units (pfu) of MVA.gpn. Fourteen days later, mice were 
5 boosted by i.v. administration of 1 x 10 6 pfu of FP9.gpn. Fourteen days after boosting, 
spleens were removed, pooled (n=4 spleens) and either left untreated, depleted of 
CD4 + cells or depleted of CD8 + cells as indicated. The number of IFN-y spot forming 
cells (SFC) per million splenocytes was determined by IFN-y ELISPOT. Columns 
represent the number of SFC/million. 

10 

CD4+ cell depletion studies demonstrated a dramatic reduction in IFN-7 secretion 
following ex vivo stimulation with peptides 15 and 66, whereas a moderate reduction 
was observed after stimulation with peptide 118. These findings suggest that CD4" 1 " T 
cells elicit EFN-7 responses against peptides 15, 66 and 118 (see figure 14). 

15 

A partial reduction in the IFN-Y response against peptides 7, 50, 85 and 118 was 
observed following depletion of CD8 + T cells. The partial reduction in EFN-7 
responses observed against these peptides indicates that they are mediated by CD8 4 " T 
cells (see figure 14). 

20 

Comparison of sequences corresponding to the identified epitopes with other 
described epitopes 

To determine whether the peptide sequences identified in this study have been defiaed 
25 previously , they were compared with those in the Los Alamos HIV sequence data 
base (mouse; see example 8 (BALB/c; H2 d )) 

Epitopes within the sequences corresponding to peptides 7,8,15 and 118 have been 
previously identified in BALB/c mice according to Example 8, whereas epitopes 
30 within sequences corresponding to peptides 50, 66 and peptide 85 have not been 
previously described. 
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Epitopes within the sequences corresponding to all seven peptides described in this 
report have been identified as immunogenic in HIV infected humans as shown in 
Table 2. 

5 Table 2: Results from Los Alamos HIV data base search. 



Peptide # 


Mouse (BALB/c; H2 a ) 


Human 


7 


MHC class I K a -restricted 


MHC class I-restricted 


8 


MHC class I K a -restricted 


MHC class I-restricted 


15 


MHC class H-restricted 


MHC class Il-restricted 


50 


this study 


MHC class I-restricted 


66 


this study 


MHC class I-restricted 


85 


this study 


MHC class I-restricted 


118 


MHC class I Dd-restricted 
MHC class II IA d -restricted 


MHC class I-restricted 
MHC class H-restricted 



These studies indicate that heterologous prime-boost immunisation regimens using 
MVA.gpn and FP9.gpn elicit potent CD4 + and CD8 + IFN-7 secreting T cell responses 
against the gag and pol regions of the gpn-molecule in BALB/c mice. In common with 
10 previous studies conducted with BALB/c mice, no immunogenic epitopes were 
identified in the nef molecule. 

All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system 

15 of the present invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the present invention. Although the present invention has 
been described in connection with specific preferred embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out 

20 the invention which are obvious to those skilled in biochemistry and biotechnology or 
related fields are intended to be within the scope of the following claims. 
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Example 8: Maps of CTL epitope locations plotted by protein 

This example presents preferred CTL epitopes following HTV Molecular Immunology 
2002: Maps of CTL Epitope Locations Plotted by Protein; Theoretical Biology & 
5 Biophysics, Los Alamos National Laboratory, August 7, 2003. Linear CTL epitopes 
less than 22 amino acids long are shown. Also shown are T-Helper epitope maps of 
the preferred T-Helper epitopes. 
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Figure 9: p24 CTL Map aa 51-100 2/2 
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^ A3 supertype 



broad 
H2k 
H-2d 



A3 



Cw4 



Al 


human 


1 


a: 


\ 


a; 


1 







GPENPYNTPVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGL 

^0 70 80 90 100 



WO 2005/030964 



79 



PCT/GB2004/004038 



RT CTL Epitope Map 



RTCTL Map aa 101-150 



Cw4 



huri. 



A2, A*0201 



A2 



A*0201 



B35 



B*3501 



Cw4 



1 1*5 



01 



B35 



B»3501 ( B35 



B+3501 



All 



E24 



hui M n 



All 
A3 

KKKKSVTVLDVGDAYFSVPLDEDFRKYTAFTIPSINNETPGIRYQYNVLP 
llo lio lio lio lio 



B51 



B*5101 



A*O217jB*5l0l 



\2 supertype 



A2 



A2 



WO 2005/030964 



80 



PCT/GB2004/004038 



RT CTL Epitope Map 



RTCTLMapaa 101-150 



Cw4 



hU1 1211 



A2, A*0201 



A2 



A*0201 



B35 



B*3501 



Cw4 



1*5 



01 



B35 



B*3501,B35 



B*3501 



All 



124 



hui 1£11 



All 
A3 

KKKKS VTVLD VGDAYF S VP LDEDFRKY TAFTI P S INNETP G I RYQ YNVLP 
llo lio 130 140 150 



B51 



B*5101 



A*0217,B*5I01 



\2 supertype 



A2 



A2 



WO 2005/030964 



81 



PCT/GB2004/004038 



RT CTL Epitope Map 



RTCTLMap aa 251-300 



B57 



B57 



B57 
B*580ji 
B*5701, B 



B*57pH 

B57 
B*570(l 



5801 



Bw62 



K2 supertypc t 



B62 



B*1501 



A30 



/ *3002 



B42 



B*4201 



A3 



A3 



A*1101 



human 



human 



B*3501, B»530l,B»5101, B*0702 



B51 



B35.B51 



B35 



B*3501 ANDB*5101 



B*3501 



SWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLTEEAE 
2^0 2^0 2A0 290 3&0 



WO 2005/030964 



82 



PCT/GB2004/004038 



RT CTL Epitope Map 



RTCTL Map aa 301-350 
Bw62 



B51 
B35J51 
B3f 

3*3501, B*530U: *5101, B*p' 
B*3501 ANI)B*5101 
B*35)l 



B62 


B*1501 


A2.1 


A2.1 




A2, A2 transgeni 




A2, A*0202 




A2, A*0201 




A2 supertype 




A2 




A2 




A2 




A02 




A*0201,A*0205 




A*0201 




A*0201 




A*0201 




A*0201 




A*0201 




A*02 




human 




A+0201 




702 





All 



A*1101 



All 



All 



human 



LELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLK 
3I0 3io 330 340 3^0 



WO 2005/030964 



83 



PCT/GB2004/004038 



RT CTL Epitope Map 



RTCTL Mapaa 351-400 



B57 



A3 



All 



A*3002 



B70 



A 11 , A*6802 



B*5701 B*5801 



B*5701 



B44 



A 


J2 


A*3 


201 







TGKYARMRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWET 



3^0 



Figure 28: RT CTL Map aa 401-450 
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Figure 29: RT CTL Map aa 451-500 
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Figure 31: RT CTL Map aa 551-560 
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Figure 16: RT T-Helper Map aa 101-150 



H-2t4 



I-Ab andHIA-DR 



H-2t4 



PR supen lotif 



H-2k,i5 



KKKKSVTVLDVGDAYFSVPLDEDFRKYTAFTIPSINNETPGIRYQYNVLP 
llo 120 130. 140 150 



Figure 17: RT T-Helper Map aa 151-200 
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CLAIMS 



1 . A recombinant polypeptide comprising amino acid sequence derived from 
(i) an HIV gag gene product; 
5 (ii) an HTV pol gene product; and 

(iii) an HTV nef gene product, 

said sequence being mutated with respect to the natural sequence of said gene product, 
and said sequence maintaining substantially all of the naturally occurring CD8+ T cell 
10 epitopes of said gene product as defined in pi 7 and p24 (gag), amino acids 1-440 of 
RT (pol) and nef shown in Example 8, said polypeptide comprising amino acid 
sequence having at least 75% identity to SEQ ID NO:9. 



2. A recombinant polypeptide according to claim 1 wherein said polypeptide 
1 5 comprises amino acid sequence having at least 95% identity to SEQ ID NO:9. 

3. A recombinant polypeptide according to claim 1 or claim 2 wherein the 
sequence identity is interepitope sequence identity. 

20 4. A recombinant polypeptide according to claim 3 wherein said polypeptide 
comprises SEQ ID NO:9. 

5. A recombinant polypeptide according to any of claims 1 to 4 wherein 
substantially all of the naturally occurring T helper epitopes of said gene product as 

25 defined in pl7 and p24 (gag), amino acids 1-440 of RT (pol) and nef shown in 
Example 8 are maintained. 

6. A recombinant polypeptide comprising amino acid sequence derived from at 
least two of 

30 (i) an HIV gag gene product; 

(ii) an HIV pol gene product; or 

(iii) an HIV nef gene product, 
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said sequence being mutated with respect to the natural sequence of said gene product, 
and said sequence maintaining substantially all of the naturally occurring CD8+ T cell 
epitopes of the corresponding part(s) of said gene product as defined in pi 7 and p24 
5 (gag), amino acids 1 -440 of RT (pol) and nef shown in Example 8 . 

7. A recombinant polypeptide according to claim 6 comprising amino acid 
sequence derived from (i) and (ii) and (iii). 

10 8. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising amino acid sequence derived from an HIV nef gene product, said 
recombinant polypeptide sequence being mutated to disrupt the function of said nef 
sequence, said nef sequence further comprising one or more T helper epitopes which 
are not present in the naturally occurring nef gene. 

15 

9. A recombinant polypeptide according to claim 8 comprising one or more T 
helper epitopes which are not present in the naturally occurring nef sequence and are 
shown in Figure 3 A. 

20 10. A recombinant polypeptide according to claim 8 or claim 9 comprising one or 
more T helper epitope(s) which are shown in Figure 3A and which are absent from 
Figure 3B. 

11. A recombinant polypeptide according to any of claims 8 to 10 further 
25 comprising all naturally occurring CD8+ T cell epitopes of the nef gene product as 
defined in Example 8. 



30 



12. A recombinant polypeptide according to any of claims 8 to 11 further 
comprising all naturally occurring nef human T helper epitopes as defined in Example 
8. 
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13. A recombinant polypeptide according to any of claims 8 to 12 wherein said 
polypeptide comprises sequence as shown in SEQ ID NO:6, or a sequence having at 
least 95% identity thereto. 

5 14. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising amino acid sequence derived from an HIV pol gene product, said 
recombinant polypeptide sequence being mutated to disrupt the reverse transcriptase 
activity of the pol sequence, wherein substantially all of the CD8+ T cell epitopes of 
the naturally occurring pol sequence as defined in amino acids 1-440 of RT (pol) 
10 shown in Example 8 are retained in said recombinant polypeptide. 

15. A recombinant polypeptide according to claim 14, wherein the reverse 
transcriptase activity of said pol sequence is mutated by duplication of an internal 
sequence derived from the centre of the naturally occurring pol gene and exchange of 

1 5 the amino and carboxy terminal portions of said pol sequence. 

16. A recombinant polypeptide according to claim 15 wherein said duplicated 
internal sequence comprises TPDKKHQKEPPF (SEQ ID NO:4). 

20 17. A recombinant polypeptide according to claim 15 or claim 16 wherein said 
polypeptide comprises sequence as shown in SEQ ID NO: 12 or a sequence having at 
least 95% identity thereto. 

18. A recombinant polypeptide according to any preceding claim, said polypeptide 
25 comprising amino acid sequence derived from an HIV gag gene product, said 
recombinant polypeptide sequence being mutated to disrupt processing of the gag gene 
product, and said gag sequence further comprising a disrupted myristoylation site, 
wherein substantially all of the CD8+ T cell epitopes of the naturally occurring gag 
sequence as defined in pi 7 and p24 (gag) shown in Example 8 are retained in said 
30 recombinant polypeptide. 
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19. A recombinant polypeptide according to claim 18 wherein the processing of 
gag is disrupted by exchanging the pi 7 and p24 domains and wherein the 
myristoylation site is disrupted by mutation of the second glycine to alanine. 

5 20. A recombinant polypeptide according to claim 18 or claim 19 wherein said 
polypeptide comprises sequence as shown in SEQ ID NO: 13 or a sequence having at 
least 95% identity thereto. 

21. A recombinant polypeptide according to any preceding claim further 
10 comprising an antibody recognition tag wherein said tag is an HA tag comprising the 

sequence as shown in SEQ ED NO:8. 

22. A recombinant polypeptide according to any preceding claim further 
comprising a CD8+ T cell epitope tag wherein said tag is a gpl60 derived tag 

1 5 comprising the sequence as shown in SEQ ID NO:7. 

23. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising the sequence as shown in SEQ ID NO: 1 . 

20 24, A recombinant polypeptide according to any preceding claim wherein the HIV 
is a clade B HIV. 

25. A recombinant nucleic acid encoding a polypeptide according to any preceding 
claim. 

25 

26. A recombinant nucleic acid sequence comprising SEQ ID NO: 11, or a 
sequence which differs only by silent mutations with respect to the genetic code, or a 
sequence having at least 95% identity thereto. 

30 27. A viral vector encoding a polypeptide according to any one of claims 1 to 24, 
said viral vector optionally being selected from the group consisting of poxviruses, 
adenoviruses, AAV, alphavirus, VSV, HSV and Sendai virus. 
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28. A viral vector according to claim 27 wherein said vector is an MVA or M.VA 
derived vector. 

5 29. A viral vector according to claim 27 wherein said vector is a fowlpox or 
fowlpox derived vector. 

30. A viral vector according to claim 29 wherein said vector is an FP9 fowlpox 
vector. 

10 

31. A nucleic acid vector comprising a nucleic acid sequence according to claim 25 
or claim 26 or encoding a polypeptide according to any one of claims 1 to 24. 

32. An adenovirus vector comprising a nucleic acid sequence according to claim 
15 25 or claim 26 or encoding a polypeptide according to any one of claims 1 to 24. 

33. A poxvirus vector comprising a nucleic acid sequence according to claim 25 or 
claim 26 or encoding a polypeptide according to any one of claims 1 to 24. 

20 34. A plasmid selected from the group consisting of p29D.gpn, pOPK6.gpn and 
pSG2.gpn. 

35. Use of a polypeptide according to any one of claims 1 to 24 in medicine. 

25 36. Use of polypeptide according to any one of claims 1 to 24 in the preparation, of 
a medicament for the treatment or prevention of HIV infection. 

37. Use of polypeptide according to any one of claims 1 to 24 in the preparation of 
a medicament for immunisation against HIV infection. 

30 

38. Use of a nucleic acid or a vector according to any one of claims 25 to 34 in 
medicine. 
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39. Use of a nucleic acid or a vector according to any one of claims 25 to 34 in the 
preparation of a medicament for the treatment or prevention of HIV infection. 

5 40. Use of a nucleic acid or a vector according to any one of claims 25 to 34 in the 
preparation of a medicament for immunisation against HIV infection. 

41. A method of immunising a subject against HIV infection comprising 
administering to said subject a polypeptide according to any one of claims 1 to 24 or a 

1 0 nucleic acid or vector according to any one of claims 25 to 34. 

42. Use of a polypeptide according to any one of claims 1 to 24 or a nucleic acid or 
a vector according to any of claims 25 to 34 as a priming agent or as a boosting agent 
in a prime-boost immunisation regime. 

15 

43. Use of a polypeptide according to any one of claims 1 to 24 or a nucleic acid or 
a vector according to any of claims 25 to 34 in the induction of an immune response. 

44. Use according to claim 43, wherein the immune response is selected from the 
20 group consisting of a CD8+ T cell response, a CD4+ T cell response, and a humoral 

response. 



45. A method for inducing an immune response in a subject, comprising 
administering to said subject a polypeptide according to any one of claims 1 to 24 or a 

25 nucleic acid or a vector according to any of claims 25 to 34. 

46. A method according to claim 45, wherein the immune response is selected 
from the group consisting of a CD8+ T cell response, a CD4+ T cell response, and a 
humoral response. 

30 

47. A recombinant polypeptide comprising amino acid sequence having at least 
95% identity to the amino acid sequence presented in SEQ ID NO:9. 
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48. A Herpes Simplex Virus (HSV) vector comprising a nucleic acid sequence 
according to claim 25 or claim 26 or encoding a polypeptide according to any one of 
claims 1 to 24. 

49. A recombinant polypeptide, recombinant polynucleotide or viral vector 
substantially as hereinbefore described with reference to the accompanying drawings. 
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AMENDED CLAIMS 

[received by the International Bureau on 28 February 2005 (28.02.05); 
original claims 1-49 replaced by new claims 1-56 (7 pages)] 

1. A recombinant polypeptide comprising amino acid sequence derived from 

5 (i) an HIV gag gene product; 

(ii) an HIV pol gene product; and 

(iii) an HIV nef gene product, 

said sequence being mutated with respect to the natural sequence of said gene 
10 products, and said sequence maintaining substantially all of the naturally occurring 
CD8+ T cell epitopes of said gene products, said polypeptide comprising an amino 
acid sequence having at least 75% identity to SEQ ID NO:9. 

2. A recombinant polypeptide according to claim 1 wherein the naturally 
15 occurring CD8+ T cell epitopes of said gene products are as defined in pi 7 and p24 

(gag), amino acids 1-440 of RT (pol) and nef shown in Example 8. 

3. A recombinant polypeptide according to claim 1 wherein said polypeptide 
comprises an amino acid sequence having at least 95% identity to SEQ ID NO:9. 

20 

4. A recombinant polypeptide according to any preceding ' claim wherein the 
sequence identity is interepitope sequence identity. 

5. A recombinant polypeptide according to claim 4 wherein said polypeptide 
25 comprises SEQ ID NO:9. 

6. A recombinant polypeptide according to any preceding claim wherein 
substantially all of the naturally occurring T helper epitopes of said gene products are 
maintained. 

30 
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7. A recombinant polypeptide according to claim 6 where the naturally occurring 
T helper epitopes of said gene products are as defined in pi 7 and p24 (ga.g), amino 
acids 1-440 of RT (pol) and nef shown in Example 8 

5 8. A recombinant polypeptide comprising amino acid sequence derived from at 
least two of 

(i) an HDTV gag gene product; 

(ii) an HIV pol gene product; or 

(iii) an HIV nef gene product, 

10 

said sequence being mutated with respect to the natural sequence of said gene 
products, and said sequence maintaining substantially all of the naturally occurring 
CD8+ T cell epitopes of the corresponding part(s) of said gene products. 

15 9. A recombinant polypeptide according to claim 8 wherein the naturally 
occurring CD8+ T cell epitopes are as defined in pi 7 and p24 (gag), amino acids 1- 
440 of RT (pol) and nef shown in Example 8. 

10. A recombinant polypeptide according to claim 8 comprising amino acid 
20 sequence derived from (i) and (ii) and (iii). 

11. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising amino acid sequence derived from an HTV nef gene product, said 
recombinant polypeptide sequence being mutated to disrupt the function of said nef 

25 sequence, said nef sequence further comprising one or more T helper epitopes which 
are not present in the naturally occurring nef gene. 

12. A recombinant polypeptide according to claim 11 wherein the one or more T 
helper epitopes which are not present in the naturally occurring nef sequence are 

30 shown in Figure 3A. 
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13. A recombinant polypeptide according to claim 1 1 or claim 12 wherein the one 
or more T helper epitope(s) which are not present in the naturally occurring nef 
sequence are shown in Figure 3A and are absent from Figure 3B. 

5 14. A recombinant polypeptide according to any of claims 11 to 13 further 
comprising all naturally occurring CD8+ T cell epitopes of the nef gene product. 

15. A recombinant polypeptide according to claim 14 wherein the naturally 
occurring CD8+ T cell epitopes of the nef gene product are as defined in Example 8 

10 

16. A recombinant polypeptide according to any of claims 11 to 15 further 
comprising all naturally occurring nef human T helper epitopes. 

17. A recombinant polypeptide according to claim 16 wherein the naturally 
15 occurring nef human T helper epitopes are as defined in Example 8. 

18. A recombinant polypeptide according to any of claims 11 to 17 wherein said 
polypeptide comprises an amino acid sequence as shown in SEQ ID NO:6, or a 
sequence having at least 95% identity thereto. 

20 

19. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising amino acid sequence derived from an HIV pol gene product, said 
recombinant polypeptide sequence being mutated to disrupt the reverse transcriptase 
activity of the pol sequence, wherein substantially all of the CD8+ T cell epitopes of 

25 the naturally occurring pol sequence are retained in said recombinant polypeptide. 

20. A recombinant polypeptide according to claim 19 wherein the CD8+ T cell 
epitopes of the naturally occurring pol sequence are as defined in amino acids 1-440 of 
RT (pol) shown in Example 8. 

30 

21. A recombinant polypeptide according to claim 19 or 20 , wherein the reverse 
transcriptase activity of said pol sequence is mutated by duplication of an internal 
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sequence derived from the centre of the naturally occurring pol gene and exchange of 
the amino and carboxy terminal portions of said pol sequence. 

22. A recombinant polypeptide according to claim 21 wherein said duplicated 
5 internal sequence comprises TPDKKHQKEPPF (SEQ ED NO:4). 

23. A recombinant polypeptide according to claim 21 or claim 22 wherein said 
polypeptide comprises an amino acid sequence as shown in SEQ ED NO: 12 or a 
sequence having at least 95% identity thereto. 

10 

24. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising amino acid sequence derived from an HIV gag gene product, said 
recombinant polypeptide sequence being mutated to disrupt processing of the gag gene 
product, and said gag sequence further comprising a disrupted myristoylation site, 

15 wherein substantially all of the CD8+ T cell epitopes of the naturally occurring gag 
sequence are retained in said recombinant polypeptide. 

25. A recombinant polypeptide according to claim 24 wherein the CD8+ T cell 
epitopes of the naturally occurring gag sequence are as defined in pl7 and p24 (gag) 

20 shown in Example 8. 

26. A recombinant polypeptide according to claim 24 or 25 wherein the processing 
of gag is disrupted by exchanging the pl7 and p24 domains and wherein the 
myristoylation site is disrupted by mutation of the second glycine to alanine. 

25 

27. A recombinant polypeptide according to any one of claims 24-26 wherein said 
polypeptide comprises an amino acid sequence as shown in SEQ ID NO: 13 or a 
sequence having at least 95% identity thereto. 

30 28. A recombinant polypeptide according to any preceding claim further 
comprising an antibody recognition tag wherein said tag is an HA tag comprising the 
sequence as shown in SEQ ED NO: 8. 
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29. A recombinant polypeptide according to any preceding claim further 
comprising a CD8+ T cell epitope tag wherein said tag is a gpl60 derived tag 
comprising the sequence as shown in SEQ ID NO: 7. 

5 

30. A recombinant polypeptide according to any preceding claim, said polypeptide 
comprising the sequence as shown in SEQ ID NO: 1. 

31. A recombinant polypeptide according to any preceding claim wherein the HTV 
10 is a clade B HIV. 

32. A recombinant nucleic acid encoding a polypeptide according to any preceding 
claim. 

15 33. A recombinant nucleic acid sequence comprising SEQ ID NO:ll, or a 
sequence which differs only by silent mutations with respect to the genetic code, or a 
sequence having at least 95% identity thereto. 

34. A viral vector encoding a polypeptide according to any one of claims 1 to 31, 
20 said viral vector optionally being selected from the group consisting of poxviruses, 

adenoviruses, AAV, alphavirus, VSV, HSV and Sendai virus. 

35, A viral vector according to claim 33 wherein said vector is an MVA or MVA 
derived vector. 

25 

. 36. A viral vector according to claim 34 wherein said vector is a fowlpox or 
fowlpox derived vector. 

37. A viral vector according to claim 36 wherein said vector is an FP9 fowlpox 
30 vector. 
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38. A nucleic acid vector comprising a nucleic acid sequence according to claim 32 
or claim 33 or encoding a polypeptide according to any one of claims 1 to 31. 

39. An adenovirus vector comprising a nucleic acid sequence according to claim 
5 32 or claim 33 or encoding a polypeptide according to any one of claims 1 to 31. 

40. A poxvirus vector comprising a nucleic acid sequence according to claim 32 or 
claim 33 or encoding a polypeptide according to any one of claims 1 to 31. 

10 41. A plasmid selected from the group consisting of p29D.gpn, pOPK6.gpn and 
pSG2.gpn. 

42. Use of a polypeptide according to any one of claims 1 to 31 in medicine. 

15 43. Use of polypeptide according to any one of claims 1 to 31 in the preparation of 
a medicament for the treatment or prevention of HIV infection. 

44. Use of polypeptide according to any one of claims 1 to 31 in the preparation of 
a medicament for immunisation against HIV infection. 

20 

45. Use of a nucleic acid or a vector according to any one of claims 32 to 41 in 
medicine. 

46. Use of a nucleic acid or a vector according to any one of claims 32 to 41 in the 
25 preparation of a medicament for the treatment or prevention of HIV infection. 

47. Use of a nucleic acid or a vector according to any one of claims 32 to 41 in the 
preparation of a medicament for immunisation against HTV infection. 

30 48. A method of immunising a subject against HIV infection comprising 
administering to said subject a polypeptide according to any one of claims 1 to 31 or a 
nucleic acid or vector according to any one of claims 32 to 41. 
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49. Use of a polypeptide according to any one of claims 1 to 31 or a nucleic acid or 
a vector according to any of claims 32 to 41 as a priming agent or as a boosting agent 
in a prime-boost immunisation regime. 

5 

50. Use of a polypeptide according to any one of claims 1 to 31 or a nucleic acid or 
a vector according to any of claims 32 to 41 in the induction of an immune response. 

51. Use according to claim 50, wherein the immune response is selected from the 
10 group consisting of a CD8H- T cell response, a CD4+ T cell response, and a humoral 

response. 

52. A method for inducing an immune response in a subject, comprising 
administering to said subject a polypeptide according to afty one of claims 1 to 31 or a 

15 nucleic acid or a vector according to any of claims 32 to 41. 

53. A method according to claim 52, wherein the immune response is selected 
from the group consisting of a CD8+ T cell response, a CD4+ T cell response, and a 
humoral response. 

20 

54. A recombinant polypeptide comprising an amino acid sequence having at least 
95% identity to the amino acid sequence presented in SEQ ID NO:9. 

55. A Herpes Simplex Virus (HSV) vector comprising a nucleic acid sequence 
25 according to claim 32 or claim 33 or encoding a polypeptide according to any one of 

claims 1 to 31. 

56. A recombinant polypeptide, recombinant polynucleotide or viral vector 
substantially as hereinbefore described with reference to the accompanying drawings. 
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STATEMENT 

Statement Under Article 19(1) 
PCT Patent Application No. PCT/GB2004/004038 

The amended claim set finds basis in the application as originally filed, in particular in the 
claims of the application as originally filed. The amended claims do not add matter to the 
application. 

Original claims 1, 5, 6,11,12 and 18 have each been divided into amended claims 1 and 2, 6 
and 7, 8 and 9, 14 and 15, 16 and 17, 24 and 25 in order to improve clarity. 

Original claims 2, 13, 17, 20 and 47 have been amended into new claims 3, 18, 23, 27 and 54 
by adding "an" or "an amino acid". 

Original claims 9 and 10 have been amended into new claims 12 and 13 in order to improve 
the references to figures 3a and 3b. 

The remaining claims have been amended purely to maintain the correct dependency. 
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FIG. 2 

Scrambled NEF from gpn 

LVWKFDSRLAFHHMARELHPEYYKDCDPEKEVLWKFDANEGENNSLLHPMSLHGMDDPEKEV 
PEKVEEANEGENGPGIRYPLTFGWCEKLVPVEPEKVEEWQNYTPGPGIRYQKRQDILDLWVYH 
TQGYFPDWQNYTPEGLI^SQKRQDIPMTYKAALDLSHFLKEKGGLEGLIYSPQVPLRPMTYKA 
ADCAWLEAQEEEEVGFPVRPQVPLRNTAANNADCAWLADGVGAVSRDLEKHGAITSSNTAANN 
RRAEPAADGVGAMGGKWSKRSWGWPTVRERMRRAEPA 



Native Nef 



MGGKWSKRS VV 

VSRDLEKHG AI 

EVGFPVRPQ VP 

LEGLIYSQK RQ 

PGPGIRYPL TF 

NNSLLHPMS LH 

HMARELHPE YY 



GWPTVRERMR 
TSSNTAANNA 
LRPMTYKAAL 
DILDLWVYHT 
GWCFKLVPVE 
GMDDPEKEVL 
K D C 



RAEPAADGVGA 
DCAWLEAQEEE 
DLSHFLKEKGG 
QGYFPDWQNYT 
PEKVEEANEGE 
VWKFDSRLAFH 
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FIG. 6 

gpn base sequence 

CCGGGCCCGCCGCCACCATGGCCCCCATCGTGCAGAACCTGCAAGGCCAGATGGTGCACCAGGC 
CATCAGCCCCAGAACCCTGAATGCCTGGGTGAAGGTGGTGGAGGAGAAGGCCTTCAGCCCTGAG 
GTGATCCCTATGTTCAGCGCCCTGAGCGAGGGCGCCACCCCCCAGGACCTGAACACCATGCTGA 
ATACCGTGGGCGGCCATCAGGCCGCCATGCAGATGCTGAAGGAGACCATCAACGAGGAGGCCGC 
CGAGTGGGACAGACTGCACCCCGTGCACGCCGGACCTATCGCCCCTGGCCAGATG^GAGAGCCC 
AGAGGCAGCGACATCGCCGGCACCACCAGCACCCTGCAAGAGCAGATCGGCTGGATGACCAACA 
ACCCCCCTATCCCTGTGGGCGAGATCTACAAGAGATGGATCATCCTGGGCCTGAACAAGATCGT 
GAGAATGTACAGCCCTACCAGCATCCTGGACATCAGACAGGGACCCAAGGAGCCCTTCAGAGAC 
TACGTGGACAGGTTCTACAAGACCCTGAGAGCCGAGCAGGCCAGCCAGGAAGTGA^GAACTGGA 
TGACCGAGACCCTGCTGGTGCAGAACGCCAACCCCGACTGCAAGACCATCCTGAAGGCCCTGGG 
ACCTGCCGCTACCCTGGAGGAGATGATGACCGCCTGCCAGGGCGTGGGCGGACCTGGCCACAAG 
GCCAGAGTGCTCATGGCCGCCAGAGCCAGCGTGCTGAGCGGCGGAGAGCTGGATAGATGGGAGA 
AGATCAGACTGAGACCTGGCGGCAAGAAGAAGTACAAGCTGAAGCACATCGTGTGGGCCTCTAG 
GGAGCTGGAGAGATTCGCCGTGAATCCTGGCCTGCTGGAGACCAGCGAGGGCTGTJAGACAGATC 
CTGGGCCAGCTGCAACCCAGCCTGCAAACCGGCTCTGAGGAGCTGAGATCCCTGTACAACACCG 
TGGCCACCCTGTACTGCGTGCACCAGAGAATCGAGGTGAAGGACACAAAGGAGGCCCTGGAGAA 
GATCGAGGAGGAGCAGAACAAGTCCAAGAAGAAGGCCCAGCAGGCCGCTGCCGAC^CCGGCAAC 
AGCAGCCAGGTGAGCCAGAACTACACCCCCGACAAGAAGCACCAGAAGGAGCCCCCTTTCCTGT 
GGATGGGCTACGAGCTGCACCCCGATAAGTGGACCGTGCAGCCCATCGTGCTGCCTGAGAAGGA 
TAGCTGGACCGTGAACGACATCCAGAAGCTGGTGGGCAAGCTGAACTGGGCCAGCCAGATCTAC 
GCCGGCATCAAGGTGAAGCAGCTGTGTAAGCTGCTGAGAGGCACCAAGGCCCTCACAGAAGTGA 
TCCCCCTCACAGAGGAGGCCGAGCTGGAGCTGGCCGAGAACAGAGAGATCCTGAAAGAGCCCGT 
GCACGGCGTGTACTACGACCCCAGCAAGGATCTCATCGCCGAGATTCAGAAGCAGGGCCAGGGC 
CAGTGGACCTACCAGATCTACCAGGAGCCTTTCAAGAACCTGAAAACCGGCAAGT2\CGCCAGAA 
TGAGAGGCGCTCACACCAACGACGTGAAGCAGTTGACCGAGGCCGTGCAGAAAATCGCCACCGA 
GAGCATCGTGATCTGGGGCAAGACCCCCAAGTTCAAGCTGCCTATCCAGAAGGAGACCTGGGAG 
GCCTGGTGGACCGAGTATTGGCAGGCCACCTGGATTCCTGAGTGGGAGTTCGTGAACACCCCCC 
CTCTGGTGAAGCTGTGGTATCAGCTGGAGAAGGAGCCTATCGTGGGCGCCGAGACCTTCCCCAT 
CAGCCCTATCGAGACCGTGCCCGTGAAGCTGAAGCCCGGCATGGACGGCCCCAAGGTGAAACAG 
TGGCCTCTCACCGAGGAGAAGATTAAGGCCCTGGTGGAGATTTGCACCGAGATGGAGAAGGAAG 
GCAAGATCAGCAAGATCGGCCCCGAGAATCCCTACAATACCCCCGTGTTCGCCATTAAGAAGAA 
GGACTCCACCAAGTGGAGAAAGCTGGTGGACTTCAGAGAGCTGAACAAGAGAACCCAGGACTTC 
TGGGAGGTGCAGCTGGGCATCCCCCACCCTGCCGGCCTGAAGAAGAAGAAGAGCGTGACCGTGC 
TGGACGTGGGCGACGCCTACTTCAGCGTGCCCCTGGACAAGGACTTCAGAAAGTACACCGCCTT 
CACCATCCCCAGCATCAACAACGAGACCCCCGGCATCAGATACCAGTACAACGrGCTGCCCCAG 
GGCTGGAAGGGCAGCCCCGCCATCTTCCAGAGCAGCATGACAAAGATCCTGGAGCCCTTCCGGA 
AGCAGAACCCCGATATCGTGATCTACCAGTACATGGACGACCTGTACGTGGGCAGCGATCTGGA 
GATCGGCCAGCACAGAACCAAGATCGAAGAGCTGAGACAGCACCTGCTGAGATGGGGCTTCACC 
ACACCAGATAAGAAACATCAGAAAGAACCACCATTCCTGGTGTGGAAGTTTGACAGCAGACTGG 
CCTTCCACCATATGGCCAGAGAGCTGCATCCTGAGTACTACAAGGACTGCGATCCCGAGAAGGA 
GGTGCTGGTCTGGAAATTCGACGCCAACGAGGGCGAGAACAACAGCCTGCTGCACCCCATGAGC 
CTGCACGGCATGGATGATCCTGAGAAAGAAGTGCCCGAGAAGGTGGAAGAGGCCAATGAGGGAG 
AAAACGGCCCTGGCATTAGATATCCCCTCACCTTCGGCTGGTGCTTCAAGCTGGTGCCTGTGGA 
GCCTGAGAAAGTGGAGGAATGGCAGAATTACACACCCGGCCCAGGCATCCGGTATCAGAAGAGA 
CAGGACATTCTGGACCTGTGGGTGTACCACACCCAGGGCTACTTCCCCGACTGGCAGAACTATA 
CTCCTGAGGGCCTCATCTACAGCCAGAAGCGGCAGGATATCCCCATGACCTACAAGGCCGCCCT 
GGATCTGAGCCACTTTCTGAAAGAGAAGGGCGGCCTGGAGGGCTTGATCTACTCCCCACAGGTG 
CCACTGAGACCTATGACATATAAGGCTGCCGACTGCGCCTGGCTGGAGGCCCAGGAAGAGGAGG 
AAGTGGGCTTCCCTGTGAGACCCCAGGTGCCCCTGCGGAACACCGCCGCCAATAATGCCGATTG 
TGCTTGGCTCGCCGACGGCGTGGGAGCCGTGAGCAGaGACCTGGAAAAGCACGGCGCCATCACC 
AGCAGCAATACAGCCGCCAACAACAGAAGAGCCGAGCCTGCCGCCGATGGAGTGGGCGCCATGG 
GCGGC?JKGTGGAGCAAGAGATCCGTGGTGGGCTGGCCCACCGTGAGAGAAAGAAT<3AGAAGGGC 
CGAACCCGCCAGAGGACCCGGCAGAGCCTTCGTGACCATCTACCCCTACGACGTGdCCGACTAC 
GCTTGATGAGGCGCGCCCT 
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Sequence Listing 

SEQ ID NO: 1 Artificial sequence - GPN 

5 MAP IVQNLQGQMVHQAI SPRTLNAWVKVVEEKAF S 
PEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQM 
LKETINEEAAEWDRLHPVHAGPIAPGQMREPRGSD 

IAGTTS TLQEQIGWMTNNPP I PVGE I YKRWI ILGL 
NKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLR 

10 AEQASQEVKNWM- TETLLVQNANPDCKTILKALGPA 
ATLEEMMTACQGVGGPGHKARVLMAARASVLSGGE 
LDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVN 
PGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 
ATLYCVHQRIEVKDTKEALEKIEEEQNKSKKKAQQ 

15 AAADTGNSSQVSQNYTPDKKHQKEPPFLWMGYELH 
PDKWTVQPIVLPEKDSWTVNDIQKLVGKLNWASQI 
YAGI KVKQLCKLLRGTKALTEV I PLTEEAELELAE 
NREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQ 
IYQEPFKNLKTGKYARMRGAHTNDVKQLTEAVQKI 

20 ATESIVIWGKTPKFKLPIQKETWEAWWTEYWQATW 
I PEWE FVNTPPLVKLWYQLEKE P I VGAETFP I S P I 
ETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTE 
MEKEGKI SKIGPENPYNTPVFAI KKKDSTKWRKLV 
D FRELNKRTQDFWEVQLGI PHPAGLKKKKSVTVLD 

25 VGDAYFSVPLDKDFRKYTAFTIPSINNETPGIRYQ 
YNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVI 
YQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFT 
TPDKKHQKEPPFLVWKFDSRLAFHHMARELHPEYY 
KDCDPEKEVLVWKFDANEGENNSLLHPMSLHGMDD 

30 PEKEVPEKVEEANEGENGPGIRYPLTFGWCFKLVP 
VEPBKVEEWQNYTPGPGIRYQKRQDILDLWVYHTQ 
GYFPDWQNYTPEGLIYSQKRQD I PMTYKAALDLSH 
FLKEKGGLEGLIYSPQVPLRPMTYKAADCAWLEAQ 
EEEEVGFPVRPQVPLRNTAANNADCAWLADGVGAV 

35 SRDLE KHGAI TSSNTAANNRRAEPAADGVGAMGGK 
WSKRSVVGWPTVRERMRRAEPARGPGRAFVTIYPY 
D V P D Y A 
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SBQ ID NO:2 Artificial sequence - GAG P24 

MAPIVQNLQGQMVHQAISPRTLNAWVKVVEEKAFS 
PEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQM 
5 LKETINEEAAEWDRLHPVHAGPIAPGQMREPRGSD 
IAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGL 
NKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLR 
AEQASQEVKNWMTETLLVQNANPDCKT I LKALGPA 
ATLEEMMTACQGVGGPGHKARVL 

10 

SEQ ID N0:3 Artificial sequence - GAG PI 7 

MAARASVLSGGELDRWEKIRLRPGGKKKYKLKHIV 
WASRELERFAVNPGLLETSEGCRQILGQLQPSLQT 
15 GSEELRS LYNTVATLYCVHQRI EVKDTKEALEKI E 
EEQNKS KKKAQQAAADTGNS SQVSQNY 

SEQ ID N0:4 Artificial sequence - POL P51 duplication 

20 TPDKKHQKEPPF 

SEQ ID NO:5 Artificial sequence - POL P5 1 core 
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SEQ ID NO:6 Artificial sequence - scrambled NEF 
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SEQ ID NO:7 Artificial sequence - 


- gpl60 murine CD8 tag 
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SEQ ID N0:8 Artificial sequence - 


-HA-tag 
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SEQ ID NO: 9 Artificial sequence 


- core GPN 
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SEQ ID NO: 1 1 Artificial sequence - gpn nucleotide sequence 





c 


C 


G 


G 


G 


C 


C 


c 


G 


C 


C 


G 


C 


C 


A 


C 


C 


A 


T 


G 


G 


C 


C 


C 


C 


C 


A 


T 


C 


G 


T 


G 


C 


A 


G 




A 


A 


C 


C 


T 


G 


C 


A 


A 


G 


G 


C 


C 


A 


G 


A 


T 


G 


G 


T 


G 


C 


A 


c 


C 


A 


G 


G 


C 


C 


A 


T 


c 


A 


G 


5 


C 


C 


c 


C 


A 


G 


A 


A 


C 


C 


C 


T 


G 


A 


A 


T 


G 


C 


C 


T 


G 


G 


G 


T 


G 


A 


A 


G 


G 


T 


G 


G 


T 


G 


G 




A 


G 


G 


A 


G 


A 


A 


G 


G 


C 


C 


T 


T 


C 


A 


G 


C 


C 


C 


T 


G 


A 


G 


G 


T 


G 


A 


T 


C 


C 


C 


T 


A 


T 


G 




T 


T 


C 


A 


G 


C 


G 


C 


C 


C 


T 


G 


A 


G 


C 


G 


A 


G 


G 


G 


C 


G 


C 


C 


A 


C 


C 


C 


C 


C 


C 


A 


G 


G 


A 




C 


C 


T 


G 


A 


A 


C 


A 


C 


C 


A 


T 


G 


C 


T 


G 


A 


A 


T 


A 


C 


C 


G 


T 


G 


G 


G 


C 


G 


G 


C 


C 


A 


T 


C 




A 


G 


G 


C 


C 


G 


C 


C 


A 


T 


G 


C 


A 


G 


A 


T 


G 


C 


T 


G 


A 


A 


G 


G 


A 


G 


A 


C 


C 


A 


T 


C 


A 


A 


C 


10 


G 


A 


G 


G 


A 


G 


G 


C 


C 


G 


C 


C 


G 


A 


G 


T 


G 


G 


G 


A 


C 


A 


G 


A 


C 


T 


G 


C 


A 


C 


C 


C 


C 


G 


T 




G 


C 


A 


C 


G 


C 


C 


G 


G 


A 


C 


c 


T 


A 


T 


C 


G 


C 


C 


C 


C 


T 


G 


G 


C 


C 


A 


G 


A 


T 


G 


A 


G 


A 


G 




A 


G 


C 


C 


C 


A 


G 


A 


G 


G 


C 


A 


G 


C 


G 


A 


C 


A 


T 


C 


G 


C 


C 


G 


G 


C 


A 


C 


C 


A 


C 


C 


A 


G 


C 




A 


C 


C 


C 


T 


G 


C 


A 


A 


G 


A 


G 


C 


A 


G 


A 


T 


C 


G 


G 


C 


T 


G 


G 


A 


T 


G 


A 


C 


C 


A 


A 


C 


A 


A 




C 


C 


C 


C 


C 


C 


T 


A 


T 


C 


C 


C 


T 


G 


T 


G 


G 


G 


C 


G 


A 


G 


A 


T 


C 


T 


A 


C 


A 


A 


G 


A 


G 


A 


T 


15 


G 


G 


A 


T 


C 


A 


T 


C 


C 


T 


G 


G 


G 


C 


C 


T 


G 


A 


A 


C 


A 


A 


G 


A 


T 


C 


G 


T 


G 


A 


G 


A 


A 


T 


G 




T 


A 


C 


A 


G 


C 


C 


C 


T 


A 


C 


C 


A 


G 


C 


A 


T 


C 


C 


T 


G 


G 


A 


C 


A 


T 


C 


A 


G 


A 


C 


A 


G 


G 


G 




A 


C 


C 


C 


A 


A 


G 


G 


A 


G 


C 


C 


C 


T 


T 


C 


A 


G 


A 


G 


A 


C 


T 


A 


C 


G 


T 


G 


G 


A 


C 


A 


G 


G 


T 




T 


C 


T 


A 


C 


A 


A 


G 


A 


C 


C 


c 


T 


G 


A 


G 


A 


G 


C 


C 


G 


A 


G 


C 


A 


G 


G 


C 


C 


A 


G 


C 


C 


A 


G 




G 


A 


A 


G 


T 


G 


A 


A 


G 


A 


A 


c 


T 


G 


G 


A 


T 


G 


A 


c 


C 


G 


A 


G 


A 


C 


C 


C 


T 


G 


C 


T 


G 


G 


T 


20 


G 


C 


A 


G 


A 


A 


C 


G 


C 


C 


A 


A 


C 


C 


C 


C 


G 


A 


C 


T 


G 


C 


A 


A 


G 


A 


C 


C 


A 


T 


C 


C 


T 


G 


A 




A 


G 


G 


C 


C 


C 


T 


G 


G 


G 


A 


c 


C 


T 


G 


C 


C 


G 


C 


T 


A 


C 


C 


C 


T 


G 


G 


A 


G 


G 


A 


G 


A 


T 


G 




A 


T 


G 


A 


C 


c 


G 


C 


C 


T 


G 


c 


C 


A 


G 


G 


G 


C 


G 


T 


G 


G 


G 


C 


G 


G 


A 


C 


C 


T 


G 


G 


C 


C 


A 




C 


A A G 


G 


c 


C 


A 


G 


A 


G 


T 


G 


C 


T 


C 


A 


T 


G 


G 


C 


C 


G 


C 


C 


A 


G 


A 


G 


C 


C 


A 


G 


C 


G 




T 


G 


C 


T 


G 


A 


G 


C 


G 


G 


C 


G 


G 


A 


G 


A 


G 


C 


T 


G 


G 


A 


T 


A 


G 


A 


T 


G 


G 


G 


A 


G 


A 


A 


G 


25 


A 


T 


C 


A 


G 


A 


C 


T 


G 


A 


G 


A 


C 


C 


T 


G 


G 


C 


G 


G 


C 


A 


A 


G 


A 


A 


G 


A 


A 


G 


T 


A 


C 


A 


A 




G 


C 


T 


G 


A 


A 


G 


C 


A 


C 


A 


T 


c 


G 


T 


G 


T 


G 


G 


G 


c 


C 


T 


C 


T 


A 


G 


G 


G 


A 


G 


C 


T 


G 


G 




A 


G 


A 


G 


A 


T 


T 


C 


G 


C 


C 


G 


T 


G 


A 


A 


T 


C 


C 


T 


G 


G 


C 


C 


T 


G 


C 


T 


G 


G 


A 


G 


A 


C 


C 




A 


G 


C 


G 


A 


G 


G 


G 


C 


T 


G 


T 


A 


G 


A 


C 


A 


G 


A 


T 


C 


C 


T 


G 


G 


G 


C 


C 


A 


G 


C 


T 


G 


c 


A 




^ 


C 


C 


C 


A 


G 


c 


c 


T 


G 


c 


A 


A 


A 


c 


c 


G 


G 


C 


T 


c 


T 


G 


A 


G 


G 


A 


G 


C 


T 


G 


A 


G 


A 


T 


30 


C 


C 


C 


T 


G 


T 


A 


c 


A 


A 


C 


A 


C 


C 


G 


T 


G 


G 


c 


C 


A 


C 


C 


C 


T 


G 


T 


A 


C 


T 


G 


C 


G 


T 


G 




C 


A 


C 


C 


A 


G 


A 


G 


A A 


T 


C 


G 


A 


G 


G 


T 


G 


A A 


G 


G 


A 


c 


A 


C 


A 


A 


A 


G 


G 


A 


G 


G 


C 




c 


C 


T 


G 


G 


A 


G 


A 


A 


G 


A 


T 


C 


G 


A 


G 


G 


A 


G 


G 


A 


G 


C 


A 


G 


A 


A 


C 


A 


A 


G 


T 


C 


C 


A 




A 


G 


A 


A 


G 


A 


A 


G 


G 


C 


C 


C 


A 


G 


C 


A 


G 


G 


C 


C 


G 


C 


T 


G 


C 


C 


G 


A 


C 


A 


C 


C 


G 


G 


C 




A 


A 


C 


A 


G 


C 


A 


G 


C 


C 


A 


G 


G 


T 


G 


A 


G 


C 


C 


A 


G 


A 


A 


C 


T 


A 


C 


A 


C 


C 


c 


C 


C 


G 


A 


35 


C 


A 


A 


G 


A 


A 


G 


C 


A 


C 


C 


A 


G 


A 


A 


G 


G 


A 


G 


C 


C 


C 


C 


C 


T 


T 


T 


C 


C 


T 


G 


T 


G 


G 


A 




T 


G 


G 


G 


C 


T 


A 


C 


G 


A 


G 


C 


T 


G 


C 


A 


C 


C 


C 


C 


G 


A 


T 


A 


A 


G 


T 


G 


G 


A 


C 


C 


G 


T 


G 




C 


A 


G 


C 


C 


C 


A 


T 


C 


G 


T 


G 


C 


T 


G 


C 


C 


T 


G 


A 


G 


A 


A 


G 


G 


A 


T 


A 


G 


C 


T 


G 


G 


A 


C 




C 


G 


T 


G 


A 


A 


C 


G 


A 


C 


A 


T 


C 


C 


A 


G 


A 


A 


G 


C 


T 


G 


G 


T 


G 


G 


G 


C 


A 


A 


G 


C 


T 


G 


A 



WO 2005/030964 



6/8 



PCT/GB2004/004038 



ACTGGGCCAG 
AAGCAGCTGT 
CCTCACAGAA 
AGCTGGAGCT 
5 GAGCCCGTGC 
GGATCTCATC 
GCCAGTGGAC 
AAGAACCTGA 
AGGCGCTCAC 

10 AGGCCGTGCA 
ATCTGGGGCA 
CCAGAAGGAG 
ATTGGCAGGC 
GTGAACACCC 

15 GCTGGAGAAG 
TCCCCATCAG 
CTGAAGCCCG 
GTGGCCTCTC 
TGGAGATTTG 

20 ATCAGCAAGA 
CCCCGTGTTC 
AGTGGAGAAA 
AAGAGAACCC 
CATCCCCCAC 

25 GCGTGACCGT 
AGCGTGCCCC 
CGCCTTCACC 
CCGGCATCAG 
GGCTGGAAGG 

30 CATGACAAAG 
ACCCCGATAT 
CTGTACGTGG 
CAGAACCAAG 
TGAGATGGGG 

35 CAGAAAGAAC 
CAGCAGACTG 
TGCATCCTGA 
AAGGAGGTGC 
GGGCGAGAAC 



CCAGATCTACGC 

GTAAGCTGCTGA 

GTGATCCCCCTC 

GGCCGAGAACAG 

ACGGCGTGTACT 

GCCGAGATTCAG 

CTACCAGATCTA 

AAACCGGCAAGT 

ACCAACGACGTG 

GAAAATCGCCAC 

AGACCCCCAAGT 

ACCTGGGAGGCC 

CACCTGGATTCC 

CCCCTCTGGTGA 

GAGCCTATCGTG 

CCCTATCGAGAC 

GCATGGACGGCC 

ACCGAGGAGAAG 

CACCGAGATGGA 

TCGGCCCCGAGA 

GCCATTAAGAAG 

GCTGGTGGACTT 

AGGACTTCTGGG 

CCTGCCGGCCTG 

GCTGGACGTGGG 

TGGACAAGGACT 

ATCCCCAGCATC 

ATACCAGTACAA 

GCAGCCCCGCCA 

ATCCTGGAGCCC 

CGTGATCTACCA 

GCAGCGATCTGG 

ATCGAAGAGCTG 

CTTCACCACACC 

CACCATTCCTGG 

GCCTTCCACCAT 

GTACTACAAGGA 

TGGTCTGGAAAT 

AACAGCCTGCTG 
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GGATGATCCTGAG 
AAGAGGCCAATGA 
AGATATCCCCTCA 
GGTGCCTGTGGAG 
AGAATTACACACC 
AAGAGACAGGACA 
CACCCAGGGCTAC 
CTCCTGAGGGCCT 
GATATCCCCATGA 
GAGCCACTTTCTG 
GCTTGATCTACTC 
ATGACATATAAGG 
GGCCCAGGAAGAG 
GACCCCAGGTGCC 
AATGCCGATTGTG 
AGCCGTGAGCAGa 
TCACCAGCAGCAA 
GCCGAGCCTGCCG 
CGGCAAGTGGAGC 
CCACCGTGAGAGA 
GCCAGAGGACCCG 
CCCCTACGACGTG 
CGCC C T 
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5-prime Apal and 3-prime AscI sites are underlined. Initiating ATG and terminating 
TGA are in bold. 
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SEQ ID NO: 12 Artificial sequence - disrupted pol 

TPDKKHQKEPPFLWMGYELHPDKWTVQPIVLPEKD 
SWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRG 
TKALTEVI PLTEEAELELAENRE ILKEPVHGVYYD 
PSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYA 
RMRGAHTNDVKQLTEAVQKIATES IVI WGKTPKFK 
LPIQKETWEAWWTEYWQATWIPEWEFVNTPPLVKL 
WYQLEKEPIVGAETFPISPIETVPVKLKPGMDGPK 
VKQWPLTEEKIKALVEICTEMEKEGKISKIGPENP 
YNTPVFAIKKKDSTKWRKLVDFRELNKRTQDFWEV 
QLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFR 
KYTAFTI P S INNETPGIRYQYNVLPQGWKGS PAI F 
QSSMTKILEPFRKQNPDIVIYQYMDDLYVGSDLEI 
GQHRTKIEELRQHLLRWGFTTPDKKHQKEPPF 

SEQ ID NO; 13 Artificial sequence - disrupted gag 

MAPIVQNLQGQMVHQAISPRTLNAWVKVVEEKAFS 
PEVI PMF SALSEGATPQDLNTMLNTVGGHQAAMQM 
LKETINEEAAEWDRLHPVHAGPIAPGQMREPRGSD 
IAGTTSTLQEQIGWMTNNPPIPVGE IYKRWI ILGL 
NKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLR 
AEQASQEVKNWMTETLLVQNANPDCKTILKALGPA 
ATLEEMMTACQGVGGPGHKARVLMAARASVLSGGE 
LDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVN 
PGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTV 
ATLYCVHQRIEVKDTKEALEKIEEEQNKSKKKAQQ 
AAADTGNSSQVSQNY 

SEQ ID NO: 14 Kozak sequence for Gag p24 
GCC GCC ACC ATGG 
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